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THE  ROLE  or  THE  mNICOWPOTER  IM  THE  IMFX3RMATIOW  RETRIEVAL  BUSIMESS 


B.K.  Penney 

Oepartnent  of  Cooputlng  and  Control, 
In^erlal  College  of  Science  and  Technology, 
London  SH7  ILU 
England. 


SUMMARY 


Various  aspects  of  the  use  of  conputers  in  infoiwtion  retrieval 
are  examined  to  determine  which  areas  may  be  better  or  more 
profitably  served  by  using  miniconvuters  rather  than  the  more 
usual  mainframe  system. 


I.  IMTRODUCTIOM 

Before  examining  the  different  aspects  of  information  retrieval  there  is  a discussion  of 
the  difference  in  characteristics  between  mlnlcoiputers  and  maxlcoiputers  (also  called  mainframes).  The 
first  aspect  of  information  retrieval  to  be  discussed  is  the  coo^uter  network;  in  particular,  the 
cosmunlcations  subsystem  of  it.  Next  is  the  data  entry  process;  this  is  fundamental  to  the  creation  and 
maintenance  of  a data  base  and  minicoit>uters  have  many  features  which  make  them  suitable  for  controlling 
this  task.  The  information  retrieval  system  Itself  is  discussed,  both  on-line  interactive  versions  and 
off-line  or  batch  types.  One  specific  application  considered  is  the  use  of  a mlnicotiputer  to  replace 
a simple  terminal  connected  to  one  or  more  remote  data  bases  via  a network.  This  approach,  although 
Involving  higher  initial  expend! ttire,  does  confer  some  advantages.  There  is  also  some  discussion  of  the 
use  of  distributed  systems. 

II.  CHARACTERISTICS  OF 'MINICOMPUTERS  AND  MAXICOMPUTERS. 

1.  Introduction; 

Minicomputera  and  maxicooputers  have  characteristics  which  differ  from  each  other  strongly  in 
some  respects.  Most  attespts  to  define  a mlnlcoiputer  start  with  discussions  of  maximum  memory  size, 
disk  storage,  software  and  other  factors,  and  finish  by  making  a definition  based  solely  on  cost.  As  it 
is  not  the  Intention  in  this  paper  to  become  Involved  in  these  usually  fruitless  discussions,  I will  not 
attempt  a definition  but  will  review  some  of  the  differences  between  minlconputers  and  maxicosputers. 

2.  Cost; 

The  most  striking  difference  is  cost.  A minicoaputer  with  disk,  line  printer,  teletype  and 
64K  bytes  of  umln  store  will  cost  in  the  region  of  £20,000  - £30,000.  Amortized  over  5 years  and  taking 
into  accomt  service  charges,  supplies,  etc.,  this  would  represent  a monthly  cost  of  fSOO-iSOO.  A main- 
frame system,  say  a middle  size  IBM  370,  would  cost  from  iO.SM  t^ards.  Amortizing  this  cost  over  five 
years,  and  adding  all  associated  costs  of  staff,  maintenance,  building,  air  conditioning,  svpplles,  etc., 
the  corresponding  monthly  costs  would  be  vpwards  of  120,000.  Of  course,  the  big  machine  has  many  more 
resources  than  the  smaller  machine  and  airports  more  users.  The  basic  question  is  whether  one  has  a 
better  deal  using  all  the  resources  of  a small  machine  or  a small  share  of  a big  machine. 

One  of  the  more  striking  facets  of  cost  which  support  use  of  big  coiputers  is  the  cost  of  disk 
storage.  The  cost  per  bit  of  disk  storage  decreases  as  the  size,  or  capacity,  of  a disk  pack  increases. 
The  typical  maxicosputer  disk  system  will  be  several  times  che^er  in  terms  of  cost  per  bit  than  the 
typical  mlnlcoiputer  system.  It  is  not  simply  a case  of  transferring  the  big  disk  systems  onto  minl- 
coiputers.  The  computer  manufacturers  do  not,  for  marketing  as  well  as  for  technical  reasons,  support  the 
use  of  the  bigger  disk  systems  on  the  smaller  coiputers. 

3.  Software; 

One  of  the  major  differences  between  minlcoiputers  and  maxlcoiputers  is  the  provision  of  soft- 
ware. Minicomputers  have,  for  example,  traditionally  been  supplied  with  comparatively  primitive  operating 
systems,  often  just  a single-user  and,  possibly,  a foreground/background  system.  In  very  recent  years 
this  situation  has  been  improving  and  several  minicomputers  do  now  have  simple-minded  timesharing  systems. 
Programing  languages  for  minlcoiputers  are  mainly  restricted  to  Assembler,  Fortran  and  Basic  or  one  of 
its  variants,  and  often  to  subsets  of  these  only.  These  are  not  necessarily  the  best  languages  available 
for  developing  infonetiofi  retrieval  systems.  The  third  area  where  softwaie  is  lacking  for  minicomputers 
is  in  data  base  management  systems  where  often  only  a primitive  filing  system  is  provided.  The  reason  for 
these  deficiencies  is  mainly  financial.  The  table  below  shows  the  annual  revenue  for  some  of  the  msjor 
coq>uter  manufacturers.  It  may  be  seen  that  the  largest  of  the  minicomputer  manufacturers.  Digital 
Equlpmsnt  Corporation,  has  a revenue  soma  30  tlmss  less  than  the  largest  of  the  mainframe  manufacturers, 
IBM.  As  a majority  of  the  cost  in  designing  and  Implemsntlng  computer  systems  lies  in  the  software,  it 
is  clear  that  there  are  nowhere  near  the  same  financial  resources  avall^le  to  develop  minicomputer  soft- 
ware as  there  are  for  maxlcoiputers. 


AmiUAL  SAUS 


$ MILLION 


IBM 

14,400 

Utalvac 

1,430 

Control  Data  Corporation 

1,363 

Honeywell 

1,324 

NCR 

950 

Digital  Equipment  Corporation 

480 

Mini 

ICL 

427 

Hewlett  Packard 

165 

Mini 

Data  General 

120 

Mini 

General  Automation 

75 

Mini 

Sources:  Dataaatlon  Septenber  1976 

Mackintosh  Consultants,  Mlcroconputer  Mlnlcoi^uter  Survey,  London,  1976. 


4.  Prograa  developaentt 

Progran  developnsnt  on  mlnlcoi^uters  also  differs  In  a nunber  of  respects.  Overall,  the 
limited  hardware  and  software  'resources  limit  the  speed  of  development,  particularly  the  edlt-coiplle-load- 
execute  cycle.  Also,  the  more  limited  subroutine  and  program  libraries  may  require  more  software  to  be 
written  on  the  mlnlcoi^uter  than  on  a maxlcosputer  with  more  extensive  libraries. 

On  all  but  the  larger  mlnlcoiputers  there  may  be  Insufficient  resources  to  allow  testing  and 
devel<^ment  to  proceed  concurrently  with  production  rtjnnlng. 

5.  Availability; 

The  availability  of  any  coii(>uter  system  Is  limited  by  the  amount  of  time  scheduled  for  systems 
maintenance  and  development  facilities.  Most  big  mainframes  require  several  hours  of  dedicated  prime 
shift  time  per  week  for  these  activities,  whereas  most  mlnlcoiputers  require  practically  no  time  at  all 
after  Initial  development.  , 

6.  Reliability : 

All  coiputers  are  likely  to  have  faults  In  both  hardware  and  software.  Hardware  faults  are 
usually  self  evident  Inasmuch  as  the  conputer  stops,  or,  for  exaaple,  a disk  suffers  a head  crash.  Soft- 
ware errors  constitute  a bigger  problem  as  their  efftcts  are  not  always  Immediately  self  evident  and  are 
thus  harder  to  locate  and  correct.  As  the  size  and  conplexity  of  software  increases,  so  do  the  difficul- 
ties of  ensuring  its  correctness,  of  maintenance  and  of  development.  It  often  happens  that  big  (grating 
systems  are  released  containing  as  many  as  a thousand  errors.  Sieg)ly  by  virtue  of  its  small  size  and 
relative  slipllclty,  reliability  Is  potentially  better  on  mlnlcosputers  than  on  big  mainframes.  Mainframes 
tend  to  have  better  hardware  fault  detection  features  (e.g.  the  IBM  370  series)  than  on  minlcoeputers , 
which  helps  to  reduce  consequential  down  time.  With  software,  however,  there  are  no  corresponding  error 
detection  ^clUtles  and  the  odnlcoaputee  Is  Intrinsically  in  a much  stranger  position.  It  is  not 
unconmon  for  minlcoi^uters  to  rm  for  snnths  without  a system  crash,  while  with  mainframes,  system  faults 
at  a rate  in  excess  of  one  a day  are  not  uncommon. 

7.  Perswmel; 

Systeam  software  programsmr  stg>port  on  a mlnlcos|>uter  may  be  as  little  as  one  person  (or  none 
at  all)  as  minlcosputer  vendors  tend  to  have  operating  systems  that  are  relatively  static.  Contrast  this 
with  the  typical  big  machine  Installation  where  the  Incorporation  of  software  patches,  updates,  and  new 
releases  are  a continuing  activity  throu^iout  the  lifetime  of  the  system  and  require  many  personnel  to 
perform  these  tasks.  In  addition,  mainframe  systems  commonly  have  one  or  more  resident  engineers  to 
maintain  the  hardware. 

I 

III.  NFTWORK  COMMimCATIOMS. 

1.  There  an  a large  nunber  of  national  and  Inteinatlonal  ooiputer  and  terminal  coimaunlcatlons 
networks  in  the  world  today.  It  Is  generally  accepted  that  the  communications  systems  should  be  driven  by 
mlnlcoeputers  and  that  any  users  should  be  connected  to  the  network  via  a standard  mlnicoiputer  (known 
generally  as  a netwoik  processor)  provided  by  the  network.  This  philosophy  has  been  followed  from  the 
beginning  of  modem  history  (ARPANET  (1))  to  the  latest  networks  such  as  the  new  European  EIN  (3)  network. 
In  ARPANET  (2),  for  example,  there  are  two  types  of  minlcosputers  used  as  network  processors.  The  Iff 

( Interface  Message  Processor)  can  Interface  tp  to  four  geographically  close  HORS  (user  co^uters ) to  the 
network,  and  the  TIP  (Terminal  Interface  Processor)  can  Interface  to  63  terminals  without  those 
terminals  needing  to  be  attached  to  a user  coiputer.  These  were  originally  based  on  Honeywell  DDP  516 
mlnlcoi^uters.  Figure  1 shows  a schematic  of  an  ARPANET  type  scheme.  The  reasons  for  this  approach  are 
discussed  In  the  following  paragraphs. 

2.  Mlnlcoeputers  are  of  coeparable  power  to  mainframes  at  handling  character  oriented  Input  output, 
and  are  thus  able  to  do  the  job  as  well.  In  fact,  the  nlnlco^>uter  will  be  doing  only  this  one  job  so  l.s 
performance  should  be  better  and  more  consistent  than  a mainframe  which  Is  doing  several  other  jobs 
simultaneously . 

3.  If  the  coeawnicatlons  handling  was  dona  In  the  mainframe,  a cosplete  and  very  coeplex  program 
would  be  required  for  each  type  of  mainframe  In  the  network.  The  program  development  would  thus  be 
e:pensive  and  time  constmlng  and  would  virtually  eliminate  networks  of  dissimilar  coeputers.  Putting  the 
conminlcatlons  handling  Into  the  minlcoiputar  means  that  only  one  type  of  program  Is  needed  for  the  lower 
levels  of  the  cosBunlcations  protocol.  Including  store  and  forward  activities,  and  more  effort  may  be 
expended  In  getting  It  right.  Thera  must  still  be  a program  In  the  users  conputer  In  order  to  commmlcata 
with  the  network  machine,  but  that  program  will  be  very  much  slnpler  than  If  all  levels  of  network 
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4.  Putting  the  communications  handling  into  a separate  miniconputer  reduces  the  load  on  the  user 

comiuter.  High  speed  character  conmunications  and  message  switching  is  a heavy  load  on  a coiputer  and  the 
required  software  occijpies  a substantial  amount  of  main  store.  Offloading  it  on  to  a minicosputer  is 
clearly  cost  effective  as  minicoitputer  hardware  is  a great  deal  cheaper  than  mainframe  hardware. 

5^  The  minicoiputer  isolates  the  user  con>uter  from  the  communications  network,  hence  any  error 

by,  or  failure  of,  the  mainframe  cannot  interfere  with  the  rest  of  the  network.  The  greater  reliability 
of  a minicomputer  will,  in  a store  and  forward  network,  give  greater  reliability  for  the  communications 
system  as  a whole  as,  even  if  the  user  computer  is  not  working,  the  minicomputer  (network  processor)  will 
still  function  and  pass  on  messages  for  other  nodes  in  the  network.  On  ARPANET  (1),  for  example,  total 
up  time  of  an  IMP  averages  99%,  while  total  up  time  for  a HOST  does  not  generally  exceed  90%. 

6.  Overall  this  use  of  minicomputers  as  network  processors  seems  to  be  entirely  justifiable  and 

sensible,  and  there  is  no  reason  why  this  situation  should  change. 

7^  An  interesting  exception  to  this  is  DEC*s  new  network  architecture,  called  DECNET,  (4),  In 

this  system  different  computers  are  linked  together  directly  without  using  front  end  processors.  The 
computers  linked,  althou^  different  (PDPll,  PDPIO,  POPS),  are  all  made  by  the  same  firm  and  as  such  this 
is  a restricted  case  and  not  a general  purpose  heterogeneous  network. 


IV.  DATA  ENTRY. 

1.  The  creation  and  maintenance  of  a data  base  can  be  carried  out  completely  off  line  using  paper 

tape,  punched  cards  or  some  magnetic  medium.  The  data  is  keyed  on  to  the  medium  and  transferred  to  a 
remote  computer  to  be  processed  off  line  as  part  of  a batch  stream. 


■ The  disadvantages  of  this  approach  in  terms  of  handling  difficulties  (especially  paper  tape!), 

' error  rates  and  turnaround  are  well  known  to  everyone  who  has  had  to  prepare  data  in  this  manner.  Paper 

J tape  is  the  worst  to  handle,  of  course,  while  punched  cards,  althougji  easier  to  handle,  suffer  from  not 

1 having  the  ability  to  record  lower  case  characters  directly  without  the  use  of  escape  codes.  This  is 

jj  particularly  important  for  bibliographic  data. 

J 2^"  The  alternative  to  off  line  data  entry  is  on  line  data  entry.  Where  an  organisation  does  not 

I have  a mainframe  in  house  it  can  use  tenminals  which  are  connected  either  to  a reimote  timesharing  system 

or  to  a local  in  house  minicomputer.  ' 

The  arguiments  for  having  data  en-  done  on  line  with  an  interactive  system  are  that 

a)  The  data  can  be  validated  and  mistakes  corrected  straight  away. 

b)  The  data  can  be  displayed  on  a screen  suitably  formatted  and  possibly  with  additional 
information. 

c)  No  handling  of  vulnerable  media  such  as  paper  tape  is  involved,  and  hence  no  errors  due  to 
mishandling  or  damage  can  occur. 

d)  Turnaround  is  shortened.  At  the  end  of  a session  data  can  be  copied  to  a majpetic  tape 
and  reports  printed  immediately,  rather  than  waiting  for  a period  of  up  to  several  days 
for  rem»te  batch  working. 


An  autoaatic  device  such  as  an  optical  character  reader  can  be  used. 

These  considerations  apply  to  both  ninlcoiputers  and  naxiconputers  equally. 
The  considerations  in  favour  of  the  minlcoiputer  epproach  are  as  follows:* 


e) 
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a)  If  t)ie  arount  of  data  entry  required  is  such  that  it  needs  more  than  a certain  nustwr 
of  taminal  connect  hours  per  week,  then  the  economics  will  favour  a local  mlnicoiputer. 

Where  the  crossover  point  lies  depends,  of  course,  on  charges  levied  for  use  of  the  time 
sharing  system,  telephone  charges  (if  the  coiputer  system  is  remote)  and  on  the  cost  of  the 
minlcoiiputer.  Back-of-envelope  calculations  suggest 

1)  that  the  crossover  point  would  lie  at  as  little  as  10  hours  per  week  even  if  the 
minlcoiputer  system  did  nothing  else,  and 

il)  that  if  the  amount  of  data  entiy  required  two  or  more  terminals,  then  the  mlnl- 
coiputer  approach  is  certainly  cheqier. 

b)  The  ergonomic  factors  are  likely  favour  a local  minlconputer.  Response  time  to  service 
one  line  of  data  input  must,  to  keep  the  operator  h^py,  be  less  than  about  half  of  a second. 
Any  miniconputer  can  achieve  this  figure  even  with  several  terminals  attached,  whereas  a 
general  purpose  time  sharing  system  may  well  have  response  times  which  are  both  long  and 
erratic;  a situation  which  is  very  frustrating  for  operators. 

c)  A minlcoiputer,  as  discussed  in  section  II,  will  have  more  ip  time  than  a mainframe  system. 
This  will  be  isportant  where  data  entry  is  required  for  extended  periods. 

At  the  U.K.  Defence  Research  Infotmatlon  Centre  (D.R.I.C.)  an  information  retrieval  project 
(5)  has  until  recently  been  run  on  a remote  batch  system  with  data  preparation  done  off  line  on  pq>er 
tape.  A minlcoiiputer  was  purchased  recently  to  take  over  some  of  the  functions  of  the  remote  system,  and 
one  of  the  more  significant  reasons  fcr  doing  this  was  to  liprove  the  data  entry  process. 


V.  THE  INFORMATION  RETRIEVAL  SYSTEM. 

1.  Basic  approaches. 

An  information  retrieval  system  can  be  Inplemented  using  one  of  three  basic  approaches. 

a)  It  can  use  a dedicated  mainframe  system 

b)  It  can  use  a share  of  a general  purpose  mainframe  system,  either  an  in  house  or  a remote 
bureau  system 

c)  It  can  use  a dedicated  minlcoiiputer  system. 

At  one  extreme  the  information  system  could  involve  very  large  databases,  extensive  usage  and, 
possibly,  many  interactive  terminals.  At  this  extreme  the  dedicated  mainframe  system  has  been  used  as 
that  has  been  the  only  way  technically  of  providing  the  necessary  resources.  Such  an  approach  is  followed 
in  the  System  Development  Corporation  (6)  and  European  Space  Agency  (7)  systems,  for  exanple.  Each  of 
these  systems  contains  many  large  data  bases  and  each  allows  many  concia'rent  users  from  a large 
geographical  area. 

At  another  extreme  the  information  retrieval  system  may  consist  of  comparatively  small  data 
bases  and  involve  only  occasional  use.  In  these  circumstances  it  would  be  possible  to  use  either  of 
approaches  b)  or  c).  The  mlniconputer  costs  are  fixed  and  are  essentially  independent  of  the  amount  of 
usage.  In  contrast  a mainframe  system  is  normally  charged  for  according  to  the  amount  of  usage  Involved. 

For  occasional  use  the  fixed  mlniconputer  costs  would  be  higher  than  the  cost  of  using  a mainframe  system, 
and  there  would  thus  be  economic  advantages  to  using  the  mainframe  system. 

In  between  these  extremes  are  many  other  possible  circumstances,  and  the  cpproach  to  be  used 
has  to  be  considered  in  terms  of  cost,  ease  of  development,  ease  of  running,performance  and  reliability. 

The  two  approaches,  shared  use  of  a mainframe  and  exclusive  use  of  a mlniconputer,  are  discussed  in  the 
following  paragraphs. 

2.  Cost; 

If  the  purchase  cost  of  a mlniconputer  is  amortized  over  a period  of  time  then  the  resulting 
monthly  cost  is  to  be  conpared  with  rental  of  time  on  a bureau  nmchlne.  Hcivor  has  shown  that  in  one 
case  (8)  the  conparatlve  costs  of  running  the  same  application  on  a bureau  machine  and  on  an  in-house 
mlniconputer  is  five  to  one,  a result  which  so  clearly  favours  use  of  a mlniconputer  that  one  wonders  why 
many  more  users  don't  take  this  ^proach.  It  may,  perhaps,  be  because  many  large  bodies,  such  as 
universities  and  defence  organisations,  provide  use  of  an  in  house  or  bureau  mainframe  machine  'free'  to 
internal  users,  whereas  the  internal  users  would  have  to  put  a strong  case  for  acquiring  a new  minlcoiiputer. 


For  on-line  use,  comnunlcatlons  charges  arise  if  the  terminals  are  remote  from  the  conputer.If 
the  conputer  used  is  a minicomputer  and  the  terminals  are  clustered,  then  it  should  be  possible  to  set 
both  in  the  same  place,  thereby  avoiding  telephone  costs  conpletely. 


An  unejqiected  penalty  faced  one  prospective  on-line  user  (10) 
connected  wt-line  to  a private  data  base  for  several  hours  each  day.  The 
sufficiently  long  by  the  operators  of  bureaux  that  he  was  asked  in  effect 
his  data  base.  This  user  did  , in  fact,  decide  to  implement  his  system  on 


in  the  U.K.  who  wished  to  be 
on-line  period  was  considered 
to  buy  the  disk  drive  which  held 
a dedicated  minlcoiiputer. 
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3. 


Performance ; 


For  off-line  processing  the  user  will  not  usually  be  concerned  with  factors  such  as  speed  of 
processing  or  operational  problems.  His  main  concein  will  usually  be  to  achieve  a reliable , short,  turn- 
around regardless  of  the  t<."j  the  job  spends  In  the  machine.  While  a mlnlcon(>uter  may  well  take  somewhat 
longer  to  run  a given  job  than  a maxiconputer,  the  mlnlcoig>uter  may  be  able  to  start  processing  the  job 
sooner  due  to  the  different  operational  environment.  Overall,  for  batch  type  processing,  the  most  signi- 
ficant factor  will  be  whether  the  system  used  Is  In  house  or  not  rather  than  whether  the  system  Is  based 
on  a mlniconputer  or  maxiconputer.  For  jobs  which  are  not  run  on  demand  but  on  a periodic  basis,  such  as 
monthly  SDI  lists,  there  should  be  no  difference  to  the  user  at  all. 

For  on-line  working  the  minicomputer  might  be  able  to  give  superior  performance  to  that  of  a 
remote  mainframe  system.  This  arises  from  the  superior  terminal  handling  properties  of  the  mlnlconputer 
when  executing  only  the  Interactive  program.  That  Is,  one  program  with  two  or  three  terminals  running  on 
a minlcooputer  will  give  a better  performance  than  a general  purpose  time  sharing  system  running  a hundred 
terminals  for  diverse  tasks.  If  the  mainframe  coitputer  was  dedicated  to  the  Information  retrieval  system 
then,  of  course.  Its  greater  total  resources  would  inevitably  give  better  performance  than  would  a mlnl- 
coiputer  system. 


4.  Reliability: 


For  batch  running  the  main  effect  of  the  computer  being  out  of  action,  whether  due  to  scheduled 
maintenance  or  to  faults , is  to  Increase  the  delay  between  the  user  submitting  a job  and  receiving  the 
output.  This  might  be  Inconvenient  but  should  not  be  over  dlsriptlve  e.g.  if  a monthly  SDI  list  Is  a day 
late  that  will  probably  not  st<^  the  intended  recipient  working.  In  an  on-line  system  the  conputer  being 
down  does  prevent  the  user  from  working  to  schedule,  and  this  Is  likely  to  be  much  more  disruptive.  The 
greater  reliability  and  overall  availability  of  mlnlconputer  based  systems  should  give  a significant 
advantage  over  mainframe  based  systems,  especially  with  interactive  working. 

5.  Security /Privacy : 

If  questions  of  privacy  (ie  unauthorised  access)  arise,  then  clearly  having  the  conputer  system 
on  site  and  dedicated  to  that  single  application  Is  a very  great  advantage,  and  obviously  a mlnlconputer 

is  suitable  for  this.  The  fact  that  it  requires  fewer  personnel  to  develop  and  operate  the  system  is 

another  advantage. 

6.  The  library  Information  retrieval  system  Is  one  area  which  seems  well  suited  to  mlnlconputer 

use.  A library  which  uses  an  on-line  interactive  system  needs  it  to  be  available  throughout  the  working 
day.  If  the  level  of  use  is  sufficiently  high  then  the  cost  of  using  a mainframe  system  may  exceed  the 
fixed  costs  of  using  a minicomputer.  In  a paper  on  the  proposed  BLIPS  system  (9)  for  special  libraries 

it  is  shown  that  a database  which  accommodates  the  equivalent  of  50,000  library  items  would  require  SOM 

characters  of  disk  space,  an  amount  which  can  be  supplied  on  one  disk  pack  on  modern  systems.  For  example, 
the  CAIRS  (10)  mlnlconputer  based  system  Is  in  use  by  a small  Industrial  research  association  In  the  UK 
and  is  a current  and  retrospective  information  system  which  offers  the  usual  facilities  such  as  SDI  list 
production,  bibliography  and  catalogue  generation,  with  on-line  interactive  searching  and  data  entry.  The 
system,  shown  schematically  in  figure  2,  is  based  on  a Texas  Instruments  980A  mlnlconputer  with  24K  words 
of  roainstore,  two  magnetic  tape  drives,  two  disk  drives,  a VDU,  printer  and  control  teletype.  With  the 
two  disks  the  data  base  can  hold  vq>  to  200,000  document  references.  This  system  was  designed  and 
implemented  in-house  and  has  been  sufficiently  successful  that  It  Is  now  being  marketed  as  a general 
system. 


The  CAIRS  Mlnlconputer  based  Information  retrieval  system. 
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Distributed  Systems: 


The  distributed  Information  system  has  the  following  principles  - 

a)  There  are  several  coii{)uters  In  the  total  system,  all  connected  In  some  way  to  each  other. 
Data  is  stored  at  the  most  appropriate  geographical  location  according  to  measures  such  as  most  connon 
use  or  cheapness  of  storage. 

b)  In  operation  most  requests  at  a particular  site,  whether  generated  on-line  or  off- line, would 
be  satisfied  by  reference  to  the  local  data  base. 

c)  That  minority  of  requests  which  could  not  be  satisfied  locally  would  be  met  by  issuing 
requests  to  the  remote  coaputera. 

One  particular  configuration  of  a distributed  system  is  the  hierarchical  system  shown 
schematically  in  figure  3. 


FIGURE  3 

Hierarchical  distributed  system. 

Here,  each  of  the  remote  sites  could  be  based  on  a dedicated  minlconputer,  each  holding  information 
pertinent  to  local  needs;  for  exan^le,  stock  lists  of  car  doors  at  one  site,  of  engines  at  another,  and 
of  steering  assemblies  at  another.  The  central  site  could  hold  a conplete  set  of  all  information  or 
summary  information  only. 

The  advantages  of  the  distributed  approach  are,  potentially  at  least,  as  follows  : 

a)  The  communications  costs  are  greatly  reduced  by  making  the  majority  of  information  available 
locally.  As  communications  costs  can  account  for  a significant  proportion  of  the  total  costs,  this 
reduction  may  be  very  sl^ilflcant. 

b)  The  performance  for  each  user  should  be  better  than  in  the  non-dlstrlbuted  system  as  each  site 
is  supplying  a ftnctlon  of  the  users  from  a fraction  of  the  data  base. 


c)  The  total  dependence  on  the  central  coaputer  is  reduced.  Now,  if  any  one  site  is  out  of 
service,  the  effect  on  other  sites  is  either  nil  or  a partial  reduction  of  service;  the  majority  of  the 
processing  should  be  able  to  continue. 

d)  The  processing  load  on  the  central  site  is  reduced. 


VI.  THE  mHICOHPUTER  AS  A TERMHAL. 


In  accessing  an  information  retrieval  system  it  is  conaon  to  use  a simple  terminal  such  as 
a VDU,  possibly  with  a slow  prlntsr.  An  alternative  to  use  of  such  a slaple  terminal  would  be  to  c«inect 
the  taradnal  dsvlcss  via  a minicomputer.  The  adnicoaputer  could  be  equipped  with  a aaiall  disk.  In 
operation  the  opsrator  would  sit  at  ths  control  devica , possibly  a VDU,  and  issue  commands  in  the  usual 
way  and  the  adnlcoaputer  would  pass  on  these  coanands  to  the  infbtiaatlon  retrieval  system.  The  messages 
from  the  infomatloo  system  to  the  terminal  could  bo  stored  on  the  mlnlcoaputar's  disk  before  printing. 
The  potential  advantages  of  this  approach  are  as  follows. 


a)  Many  Information  systems  are  accessible  to  any  particular  user  for  a limited  period  of  each 
day.  For  exasple,  the  'window'  for  UK  users  of  the  System  Development  Con>oratlon  system  is  about  half  of 
normal  working  hours.  If  infonnation  can  be  transmitted  rapidly  to  the  mlnlconputer's  disk  it  could  then 
be  printed  later,  off-line.,thereby  freeing  the  available  on-line  tiiae  for  more  productive  woi*. 

b)  The  costs  may  reduce  as  printing  which  is  to  be  done  at  a user  site  will  not  need  to  be  done 
during  eivettslve  on-line' time.  In  the  UK  use  of  information  systems  costs  to  ilOO  per  hour,  so  any 
reduction  in  on-line  time  will  be  quite  significant. 

c)  The  possibility  of  doing  more  printing  at  users'  sites  should  eliminate  postal  delays  for 
printing  which  would  otherwise  be  performed  at  the  information  retrieval  system  and  sent  on  throu{^  the 
posts. 

d)  If  the  results  of  a search  have  been  stored,  then  the  user  can  add  the  results  to  a personal 
data  file.  This  file  can  be  built  from  several  sources  and  as  the  result  of  many  searches,  and  could 
be  processed  further  in  many  ways.  This  processing  could,  for  exaiple,  be  further  selection,  report 
generation  or  even  confutation  on  numerical  data. 

These  personal  files  could  be  copied  onto  cassette  tape  or  floppy  disk  and  stored  securely 
and  privately  in  users  own  offices. 

e)  The  minlconfuter  could  support  use  of  several  terminals  concurrently,  both  for  accessing  its 
own  files  ('off-line'  mode)  and  for  accessing  a remote  information  retrieval  system  ('on-line'  mode). 

By  appropriate  software  this  on-line  use  could  be  made  to  appear  as  being  simply  more  intensive  use 
of  a single  terminal. 

f)  Not  all  networks  and  computing  systems  allow  use  of  compatible  terminals.  An  eeur'lier  paper  in 
this  conference  (11)  described  how  in  Sweden  it  was  necessary  to  keep  three  different  terminals  at  one 
site  in  order  to  access  three  different  systems.  If  access  to  networks  or  to  computer  systems  is  made 
via  a minicomputer,  then  the  minicomputer,  by  supporting  the  required  protocol  (implemented  in  software), 
could  allow  access  to  different  computers  and  networks  via  a single  set  of  hardware  (the  minicomputer  and 
terminals). 

In  order  to  evaluate  this  technique  an  e]q>eriisent  has  been  Initiated  in  the  Utalversity  of 
London  by  the  Central  Information  Services  unit  and  Imperial  College.  An  intelligent  terminal  (an  ' 

Incoterm)  will  be  sited  in  the  University  library  in  order  to  provide  access  to  reimote  data  bases.  This 
will  have  a floppy  disk  unit  attached.  The  disk  will  not  be  large  enough  to  store  all  data  required, 
nor  will  the  processor  or  main  store  be  sufficiently  powerful  to  do  all  the  processing  required.  Instead, 
the  termiinal  will  be  connected  via  a dial  vp  telephone  line  to  a GEC  4080  midi  computer  at  Imperial 
College,  as  shown  in  figure  4.  This  computer  will  supply  the  bulk  disk  storage  and  processing  power  required. 
It  is  Intended  that  this  system  should  be  in  use  for  tilo  years  starting  in  imld  1977.  The  Incoterm  terminal 
will  be  used  full  time  in  normal  working  hours,  but  the  GEC  4080,  which  is  used  for  other  applications,  will 
be  used  for  this  purpose  for  only  one  or  two  hours  per  day. 


nCURE  4 


Connection  of  intelligent  terminal,  to  reimote  minicomputer  for  evaluation  experiments. 


VII. 


CONCLOSKWS. 


he  mlnlcoiputer  is  a viable  and  desirable  alteraatlve  to  a mainframe  coiq>uter  in  several 

(but  not  a. . aspects  of  Information  retrieval  systems. 

a)  As  netwot4c  processors  In  the  communications  subsystem  of  any  cos(>uter  networ4c  their 
use  Is  already  widespread. 

b)  Where  the  Information  retrieval  requirements  are  small  enough  that  not  all  of  the 
resources  of  a mainframe  conputer  are  needed,  there  are  several  circumstances  where  a 
dedicated  mlniconputer  is  superior  to  a share  of  a mainframe  ^stem  in  terms  of  cost, 
performance  and  reliability. 

c)  For  interactive  on  line  data  entry  the  mlnlconputer  will  give  better  performance  than  a 
general  purpose  time  sharing  system,  and  If  the  number  of  connect  hours  Is  high  enough  a 
dedicated  minlconputer  can  be  cheeper  than  part  use  of  a mainframe. 

d)  In  the  replacement,  or  'front  ending',  of  single  terminals  connected  to  remote  information 
retrieval  systems,  they  offer  severed  advantages , both  economic  and  functicDal. 
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SIMURI 

After  outlining  the  functions  of  DRIC,  this  paper  describee 
the  use  of  a mini-oonputer  to  prepare  an  Abstracts  Bulletin 
and  its  indexes,  replaoing  an  earlier  partly-mechanised 
system,  and  to  prooide  data  on  the  exchange  of  reports  &ith 
foreign  countries.  Some  of  the  problems  encountered  during 
' inplementation  of  the  counter  systems  are  discussed, 

ij  Future  possible  applications  for  the  conputer  are  described. 

‘A  They  include  a repister  of  the  interests  of  WIC  's 

^ I customers,  a loans  control  system  particularly  for  classi- 

f ; fied  reports,  thesaurus  look-up  to  help  the  scientific 

staff,  and  information  retrieoal  (both  SDI  and  retrospective). 
’ Finally,  there  is  a brief  eumary  of  other  uses  of  computers 

in  the  UK  Ministry  of  Defence  information  and  library 
eeroices. 


INTRODUCTICT) 

1 . During  the  past  year  DRIC  has  obtained  a uni-co^uter  tdiich  is  now  used  to  pr^re  printing  masters 

j for  a twice-monthly  Abstracts  Bulletin  and  its  indexes.  It  also  provides  quantitative  information  on  the 

] exchange  of  reports  with  foreign  countries.  Later,  it  will  be  leed  to  maintain  a field  of  interest 

register  of  nUC’s  customers,  to  provide  loans  information  for  classified  reports,  printing  recall  rotices 
as  necessary,  and  to  provide  aids  for  the  subject  analysts  and  the  editors  of  the  Abstracts  Bulletin. 

2.  It  mi^t  help  to  set  the  scene  by  first  describing  briefly  the  origin  ^ functions  of  DRIC.  Ihe 
Defence  Resear^  Information  Centre  (DRIC)  is  the  central  facility  of  tte  Ministry  of  Defence  fbr  the 
acquisition,  announcesmnt  and  distribution  of  Defence-orientated  uqxiblished  scientific  and  technical 
reports  originating  in  the  UK  and  overseas  and  more  particularly  those  reports  which  are  Defence- 
controlled  or  classified.  DRIC  was  formed  in  1971  a merger  of  the  defence  element  of  the  former 
Ministry  of  Technology  Reports  Centre  and  the  Naval  Scientific  and  Technical  Informatim  Centre  (NSTIC) 
of  the  Ministry  of  Defence  Navy  Di^rtment.  Ihe  Mintech  Reports  Centre's  fmctions  were  describe  in  a 
paper  presented  by  Mr  S.C.  Schuler  to  the  TIP  Specialists  Meeting  in  Ottawa,  September  1969  (ref.l. 

Sec. 2).  NSTIC  had  a similar  role  with  regard  to  report-handling  and  also  acted  as  a central  library 
for  bMks  and  periodicals  and  ran  a translations  service  for  the  Navy  Department.  The  library  function 
has  been  taken  over  by  the  Ministry  of  Defence  Library  Service,  but  the  translations  service  is  still 
the  responsibility  of  DRIC.  The  non-defence  element  of  the  Mintech  Reports  Centre  now  forms  the 
Technology  Reports  Centre  (TBC)  of  the  Department  of  Industry.  The  changes  are  shown  diagiwmmtically  in 
Fig.  1 (on  following  page).  DRIC  and  IRC  may  be  considered  as  iqiproximately  equivalent  to  DOC  and  NTIS, 
respectively,  in  the  USA. 

3.  DRIC  provides  information  services  to  all  branches  of  the  Ministry  of  Defence,  their  contractors 
and  associates,  on  Defence  Classified  and  Controlled  reports,  as  well  as  certain  other  ancillary 
services.  DRIC  has  a special  responsibility  for  aerospace  information  and  is  the  UK  National  Distribu- 
tion Centre  for  AQVRD  documents.  DRIC's  staff  naber  92,  Including  19  scientists  and  6 ADP  specialists. 
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Fig.l.  The  FoTBiation  of  DRIC 


SERVICES  PROVIEED  BT  ERIC 

4.  DRIC  provides  5 aiain  services  and  a nnfcer  of  ancillary  ones: 

(a)  Master  Record  Centre  for  British  Defence  Reports 

Tlie  classification  and  Conditions  of  Release  of  British  Reports  are  checked  vdth  the  appropriate 
authorities  in  M3D  and  procedures  are  used  to  ensure  that  stipulated  lixdtations  on  release  are 
observed. 

Clearance  with  Patents  Branch  and  releasing  authorities  is  sought  as  qrpropriate.  Qassified 
and  other  controlled  reports  are  transnitted  to  and  fron  overseas  couitries  via  secure  channels. 

(b)  Abstracting  ^ s^fect  indexing  UK  and  foreign  Defend  reports  and  publishing 
a twice-monthly  Abstracts  Bulletin  toefence  Research  Abstr^ts  Ciiuv) ) 

Different  editions  of  IKA  are  published  for  MOD  and  Defence  Contractors.  Each  issue  contains 
subject,  author  and  report  nuiter  indexes,  and  DRIC  also  publishes  quarterly  and  annual  oaula- 
tive  indexes  containing  additionally  indexes  of  titles,  corporate  authors,  conference  psMrs, 
translations  and  contract  mariaers.  A five-year  oaulative  index  on  nicrofiche  prepared  by 
Coaixiter  Output  to  Microfbm  (COM)  will  be  published  later  this  year.  Magnetic  tapes  corres- 
ponding to  the  data  in  DM  in  both  serial  and  inverted  file  torm  will  shortly  be  sigiplied  each 
■onth  to  the  larger  RID  establishaents  to  enable  theai  to  nat  their  own  Selective  Dissenina- 
tion  of  Infbraation  (SDI)  services.  The  software  for  this  will  be  provided  by  DRIC. 

(c)  Distributing  th^  reports  to  the  Defence  Coi^ty  within  UK  and  fin  tte 
case  of  UlC-OTiglnated  reportsl  to  Defence  organisations  in  other  a^tries 

British  reports  are  distributed  according  to  recomaendations  by  the  originators  and  the 
Technical  Prograame  Authority  responsible  for  the  work;  foreign  reports  are  distributed 
according  to  any  instructions  acoapanying  the  reports.  However,  DRIC  often  adds  other 
recipients  within  MOD  in  the  light  of  the  subject  natter  and  the  known  interest  of  potential 
recipients,  depending  on  the  'conditions  of  release'. 


(d) 


r 


1 

U 


Resp(y.iding  to  requests  from  the  Defence  comity  for  such  reports 


Requests  my  be  met  by  loans,  by  retention  «»ies,  or  by  microfiche  (always  sent  for  retention),  I 

(landing  on  the  needs  of  the  requester  and  the  nimi>er  of  copies  available.  ERIC  also  atteq;>ts  \ 

to  <Atain  r^rts,  both  British  ^ foreign,  idiich  have  been  requested  but  have  not  previously  i 

been  received  by  ERIC.  ' 

(e)  Searching  the  report  literature  in  response  to  requests  for  infoimticm  by  members  of  MM)  or  by 
Eefence  Contractors^  ERIC  has  an  on-line  VEU  terminal  to  the  European  Space  Agency's  REOON 
system,  viiich  holds  NASA/STAR  aiKi  NTIS/GRA  files,  and  will  shortly  have  a similar  terminal  for 
searching,  back  to  1970,  data  from  BRA  vdiich  are  held  in  comiuter-readable  form  at  a M3D 
conputer  bureau.  Manual  searches  of  ERA  are  carried  out  at  present,  and  will  still  be  needed 
for  items  earlier  than  1970  and  other  data  bases.  ERIC  also  provides  a limited  monthly  SDI 
service  for  members  of  M9D,  based  on  the  reports  listed  in  BRA.  Throu^  the  agency  of  the 
Atomic  Weapons  Research  Establishment,  Aldeimaston,  a similar  service  is  provided  from  several 
of  the  mjor  data  bases  such  as  NTIS/GRA,  INSPBC  and  Chranical  Abstracts  Condensates. 

(f)  The  ancillary  services  include  translating  non-English  language  documents  for  R4D  establish- 
ments  and  sullying  foreign  military  specifications.  ERIC  holds  a cooplete  and  regularly  up- 
dated set  of  US  Military  ^iecifications  and  related  documents  on  microfilm  (about  105,000 
documents)  together  with  hard  copies  of  many  of  them  (mainly  defence  and  aerospace  items) . 

The  in-house  conputer  is  currently  used  for  items  a & b above,  and  may  be  used  for  item  e later. 

5.  The  main  flows  of  information  within  ERIC  are  shown  in  Fig. 2,  viiich  includes  some  statistics 
relating  to  operations  during  1975.  Thus  some  13,000  report  titles  were  handled,  5000  from  the  UK  and 
8000  from  abroad,  and  about  110,000  copies  of  reports  were  distributed.  Defence  Research  Abstracts 
contained  9000  items.  Requests  for  reports  numbered  about  40,000;  for  military  specifications  about 
8000;  and  for  literature  searches  and  translations,  about  600  each. 

USE  OF  THE  MDU-OOMPUTER  K«  THE  PRODUCTICTJ  OF  MFENCE  RESEARCH  ABSTRACTS  (ORA) 

6.  The  mini-conputer  at  MUC  (a  British  GEC  4080)  viiich  was  coranissioned  in  1975  is  shown  in  Fig.  3, 
and  its  main  features  are  listed  in  Fig.  4.  It  should  peih^s  more  properly  be  called  a midi-conputer 
as  it  has  more  manory  and  larger  discs  than  most  minis.  It  is  a 32-bit  viord  machine  with  (at  DRIC) 

64K  bytes  of  semi-conductor  store  and  19.2  megabytes  on  2 fixed  and  2 exchangeable  discs.  to  256K 
bytes  of  store  are  available  vd,th  one  CPU,  and  GEC  plan  in  future  to  sipport  at  least  4 x 70  megabytes 
of  disc  storage.  The  most  interesting  feature  of  the  conputer  is  the  •I'kicleus',  a hardvare  micro- 
prograaned  executive,  vdiich  manages  store  protection,  conminications  between  programs,  and  short-term 
schediling  of  programs  in  the  system.  This  enables  the  ccmputer  to  provide  extensivre  operating  system 
facilities  which  vanild  otherwise  be  impractical  because  of  high  overheads.  The  Disc  Operating  System 
(DOS)  includes  extremely  powerful  Data  Nbnagement  software  which  provides  device-independent  data  trans- 
mission to  and  from  the  peripherals  for  both  applications  and  operating  system  programs.  The  program 
does  not  need  to  specify  which  files  or  devices  are  to  be  used,  this  being  draie  at  run-time  by  a connand 
vdiich  selects  files  and  peripherals  for  connection  to  the  'streams'  (essentially  input  and  output  ports) 
of  a program.  Up  to  256  programs  or  processes  may  be  handled  c(H»anrrently.  MUC's  installation  has  3 
VDUs  and  an  ipper  and  lower  case  line-printer.  Languages  currently  siqjported  are  FORTRAN  IV,  CORAL  66, 
and  GEC's  own  language,  Babbage  - an  assenibly  language  voritten  in  the  form  of  a high-levrel  language  and 
in  consequence  such  easier  to  use  than  most  assembly  languages.  Interactive  Basic  and  Algol  60  will 
also  be  si^ported  under  a new  operating  systan  recently  announced  by  GEC.  Other  organisations  such  as 
universities  have  vn-itten  conpilers  for  other  languages. 

7.  To  understand  how  this  conputer  is  used  for  the  production  of  DRA  it  might  be  helpful  first  to 
sumnarise  the  way  in  vdiich  ERA  vas  prepared  until  the  end  of  1975.  This  is  shown  diagramraatically  in  Fig.  5a 
(p2-6)  and  is  described  in  Section  4 of  Ref.l  and  in  more  detail  in  Ref .2.  These  references  both 
describe  a system  using  a different  make  of  tape  typevrriter,  but  only  minor  changes  had  been  made  by 
1975. 

8.  Reports  received  by  DRIC  were  indexed  for  subject  content  by  the  scientific  staff  vho  also  prepared 
abstracts.  After  technical  editing  and  the  addition  of  COSATI  subject  codes,  the  indexing  descriptors 
and  abstracts,  together  vdth  the  bibliographic  information  (but  not  the  COSATI  codes),  vrere  punched  on 
paper  tape  using  Datamatic  tape  typevnriters . Details  of  Corporate  Authors  vrere  inserted  at  this  stage 
from  edge-punched  cards  (vdth  an  8-hole  paper  tape  format)  selected  manually.  Tvre  typed  outputs  vrere 
produced.  The  first,  on  a so-called  'flimsy',  gave  a complete  listing  of  vhat  vias  punched  and  was  placed 
in  an  envelope  vd.th  the  corresponding  length  of  paper  tape.  The  seco^  output,  a card  printed  through  a 
carbon  patch  on  the  flimsy  and  containing  only  the  bibliographic  details,  vias  used  by  the  appropriate 
report-handling  section  of  DRIC  as  a document  'movements'  card  on  vhich  the  number  of  copies  held,  any 
initial  distribution,  any  loans  and  returns  vrere  recorded. 
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2.  Ridler,  E.H.  et  al  Production  of  RM)  Abstracts  and  printed  indexes  to  RM)  Abstracts  using  tape 
typevnriters 

TIL  Report  21,  Aug.  1969 


9.  Qeiks  then  proof-read  the  'fliasy*  against  the  original  dooaent,  checking  for  typing  errors,  and 
narked  axQr  errors  in  red.  Ihe  envelopes  (containing  the  annotated  fliasies  and  corresponding  tmes) 

Mere  then  sorted  nanuallv  by  the  OOSATI  codes,  this  being  the  order  chosen  for  presentation  Li  ntA.  The 
ordered  envelopes  Mere  then  returned  to  the  tape  typists  vho  used  the  pundied  tapes  to  operate  the  type- 
writers autonatically  thus  producing  a second  t^,  stopping  as  necessary  to  nake  any  corrections.  Ihe 
output  fron  this  stage  was  a nunber  of  p^r  tq>es,  each  containing  corrected  details  of  some  25  reports, 
and  a typescr^t  in  ntA  fomat.  Ihe  latter,  vdiich  was  larger  than  the  A4  size  of  ntA,  was  sent  to  UlIC's 
Reprographic  Section  for  the  production  of  ntA  copies  by  offset  litho.  The  Contractors'  Edition  of  ntA 
was  later  coopiled  by  a cut-and-paste  nethod  fron  tte  nasters  used  for  the  MX)  Edition,  unsuitable  itens 
being  renoved. 

10.  After  a check  for  parity  errors,  the  long  lengths  of  tape  were  sent  to  an  ICL  1900  co^uter  at  an 
MX)  bureau  for  production  of  the  inderas  to  the  MX)  edition  of  ntA  and  for  the  SDI  service.  The  data 
they  contained  were  also  added  to  a data  base  held  on  —g^)r  tqpes  at  the  bureau  and  covering  itens 
included  in  ntA  fron  1970.  Because  the  Contractors'  Edition  mbs  prepared  by  a cut-and-paste  nethod,  no 
indexes  could  be  provided  for  it  by  the  co^xiter. 

11.  It  was  felt  that  this  systen  siade  too  amqr  dennds  on  the  clerical  and  typing  effort  and  that  it 
could  be  siiqilified  by  using  a mini-cxaqiuter,  with  an  ipper  and  lower  case  line  printer.  Moreover,  the 
■ini  would  have  other  advantages:  it  would  nore  readily  allow  suitable  itens  to  be  printed  in  WA  under 
■ore  than  one  OOSATI  subject  heading;  it  could  produce  indexes  vhich  were  nore  legible  than  the  tqpper 
case  indexes  prepared  on  the  1900  coi^ter;  and  the  nasters  for  both  the  MX)  and  the  Contractors' 

Editions  could  be  prepared  directly  from  the  cooputer,  allowing  the  latter  Edition  to  have  indexes  for 
the  first  tine,  ^id,  since  the  vnrk  vould  be  dm  at  ERIC,  and  the  cooputer  would  be  under  our  own 
control,  turn-round  would  be  quicker  and  it  would  be  easier  to  control  the  quality  of  the  print-out. 

The  cooputer  would  also  have  other  applications  in  DRIC  (discussed  in  paragra{dis  18-25  below) . A 
detailed  feasibility  study  concluded  that  the  installatim  of  a nini-cooputer  in  DRIC  would: 


(a)  increase  efficiency  in  the  preparation  of  data  for  IKA  and  indexes; 

(b)  increase  security  in  the  production  of  IRA  (by  reducing  the  amount  of  movement  of  classified 
infonnation) ; 

(c)  increase  security  in  the  movement  of  classified  doosnents  (see  paragraph  22  below) ; and 

(d)  provide  a generally  iaproved  service  to  all  ERIC  customers. 
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12.  The  Systran  finally  devised  for  the  production  of  DRA  is  shown  in  Fig. 5b.  The  first  3 stages  are  ^ 

^ ; essentially  the  same  as  for  the  previous  manual  system  (Fig. 5a),  including  the  production  of  the  nove- 

I ments  card,  the  main  difference  being  the  addition  of  data  fiel^  to  the  punched  tape  to  indicate  the 

CDSATI  heading(s)  and  the  edition  of  IKA.  However,  many  items  are  now  punched  in  random  order  onto  one 

tape  and  this  tape  is  read  strai^t  into  the  4080  coainiter.  Here  a ramfcer  of  validation  checks  are 

ai^lied,  such  as  checking  that  the  accession  number  is  of  the  correct  form,  i.e.  has  the  appropriate 

nimiber  of  letters  and  digits  correctly  arranged,  that  an  acceptable  security  classification  is  present,  ^ 

that  the  date  field  is  correctly  constituted  and  represents  an  acceptable  date,  and  that  the  number  of 

data  fields  is  correct. 

' 13.  The  next  stage  will  be  to  display  the  complete  details  on  a VDU  screen  together  with  an  indication 

of  any  errors  identified  during  the  validation  run,  as  shown  in  Fig.  6.  Errors  shown  in  this  figure  are 
a date  with  the  month  numbered  '15',  and  a descriptor  longer  than  %)  characters(a  '/'  having  been 
omitted).  The  errors  are  in  fact  shown  in  reverse  video  (v4iite  on  black)  to  make  them  stand  out  more 
I clearly,  but  this  could  not  be  shown  in  the  figure.  The  proof  checkers  will  work  directly  from  this 

I display  and  correct  the  errors  indicated  by  the  congxiter  and  any  others  they  notice  by  reference  to  the 

original  report,  such  as  the  mis-spelling  of  'gusts'  and  'stratosidiere'  in  Fig. 6,  using  the  keyboard 
editing  controls.  To  save  time  only  a sample  check  of  abstracts  (as  opposed  to  bibliographic  details 
and  indexing  descriptors)  is  made,  a test  having  shown  that  few  significant  errors  arise  there.  The 
second  punching  of  paper  tape  is  rendered  unnecessary  by  this  use  of  the  VDUs.  We  intend  to  investigate  1 

the  possibility  of  direct  keying  to  the  conputer  at  a later  date,  vdiich  would  simplify  the  operation 
still  further  and  allow  validation  checks  to  be  made  at  the  data  preparation  stage.  Other  possibilities 
are  key  to  magnetic  tape  or  disc. 
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Raeant  RAE  rasaareh  onCguaats^haa  baan  particularly  concarnad 
with  savara  ousts  and  the  situatlona  In  which  they  occur.  In 
thafaratospher^  mountain  wave  conditions  and  those  In  tha 
vicinity  of  thunderstorm  tops  have  baan  Invastlgatad.  At  lower 
altitudes,  gusts  In  and  near  thunderstorms  have  also  baan 
studied, as  have  wind  and  gust  effects  likely  to  be  significant 
during  taka-off  and  landing.  This  work  has  relevanea  both  to 
aircraft  operations  and  to  aircraft  design.  In  tha  latter 
connection,  recant  work  on  mathematical  models  of  severe  gusts 
Is  also  described.  Mention  Is  made  Is  made  of  tha  affects  of 
pilot  control  activity  during  flight  through  gusts.  PBP? 


Fig. 6.  Exanple  of  the  Display  on  a VDU  for  Proof-checking 


(The  question  marks  identify  the  ends  of  data  fields,  some 
of  vdiich,  eg  contract  number  and  similar  details,  are  not 
applicable  to  this  example  and  have  been  left  blank) 


14.  The  corrected  items  for  a vdwle  period  (half  a month)  are  sorted  by  the  conputer  into  CDSATI  subject 
order  and  according  to  the  appropriate  editions  of  ERA,  being  duplicated  by  the  conputer  if  they  {ppear 
in  both;  and  cross-reference  entries  without  an  abstract  are  generated  for  subsidiary  (DSATI  headings. 
This  sorting  of  about  400  envelopes  used  to  take  a considerable  time  vhen  performed  manually.  The 
isasters  for  printing  both  editions  of  ORA,  including  the  indexes,  are  produced  on  the  line  printer.  The 
opportunity  was  taken  to  inprove  the  lay-out  of  the  indexes  and  exanples  of  the  old  and  new  subject 
Intexes  are  shown  in  Fig. 7.  A magnetic  tape  is  also  prepared  by  the  mini-conputer  for  use  in  building 
up  the  data  base  and  naming  the  SOI  service  cm  the  ICX  1900  at  the  MX)  Bureau. 
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Fig. 7a 

Old  Form  of  Subject  Index  to  DRA  (prepared  by  bureau  conputer) 
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Fig. 7b  Present  Form  of  Subject  Index  to  DRA  (prepared  by  in-house 

mini-computer) 

15.  A list  of  corporate  author  entries  is  to  be  held  on  the  conyniter  and  appropriate  entries  will  be 
added  autonatically  to  the  bibliogra{diic  details,  the  appropriate  reference  nunj>er  being  punched  at  the 
initial  punching  atage  instead  of  the  full  details.  T)k  errors  inherent  in  maintaining  large  nmbers  of 
duplicate  edge-punched  cards  will  then  be  eliminated.  Because  the  original  6-digit  mmhering  system  for 
corporate  au^rs  had  proved  inadequate  to  cope  with  organisations  having  large  numbers  of  subdivisions, 
such  as  the  UK  Ministry  of  Defence,  it  was  decided  to  revise  the  list  accordi^  to  standard  cataloguing 
rules  (with  minor  local  variations)  and  assign  new  nunbers,  incorporating  check  digits,  to  each  entry. 

ERIC  h^  insufficient  effort  available  to  do  this  in-house  and  so  a contract  was  placed  for  it  to  be 
done  externally. 

Thesaurus  Oiecking 

16.  The  next  stage  will  be  to  store  the  ccnplete  Thesaurus  on  the  conputer.  DRIC  uses  the  Thesaurus 
of  Engineering  and  Scientific  Terms  (TEST)  (Ref .3)with  local  additions  to  meet  the  danands  of  recent 
technology  and  to  cope  with  identifiers  such  as  MICA  and  Chieftain.  Storing  the  Thesaurus  on  the  conputer 
will  have  two  main  benefits.  Firstly,  it  will  enable  the  subject  indexers  and  technical  editors  to  make 
reference  to  an  up-to-date  list  of  added  terms,  using  VDUs.  Seccjndly,  it  will  enable  the  conputer  to 
check  the  accept^ility  of  the  descriptors  and  the  accuracy  of  punching  at  the  validation  stage,  any 
errors  being  displayed  on  the  VDU.  The  proof-checkers  can  correct  wrongly-spelled  or  mis-punch^ 
descriptors,  but  any  vMch  are  not  found  in  the  Thesaurus  need  to  be  listed  for  a senior  indexer  to  check. 
Accept^le  new  descriptors  will  then  be  added  to  the  Thesaurus  for  future  use. 


3.  Engineers  Joint  Council  in  co-cperation 
with  the  Department  of  Defense,  USA. 


Thesaurus  of  Engineering  and  Scientific  Terms 
First  Edition,  December  1967 
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17.  me  holds  a copy  of  the  original  version  of  TEST  on  tape  at  the  bureau  conputer.  This  has  been 
used  to  check  past  indexing  and  produce  a list  of  descriptors  used  which  are  not  in  TEST.  A contract  I 

was  recently  placed  to  correct  spell^  and  punching  errors  and  to  list  the  others  so  that  they  could  be  ] 

considered  by  the  indexers  as  potential  additions  to  the  Thesaurus.  The  ig>-dated  Thesaurus  will  shortly  ^ 

be  stored  on  the  mini-can|Mter,  for  use  as  described  above.  On  the  bureau  conputer,  the  indexing  terns  I 

used  in  the  past  will  be  corrected  so  that  retrospective  searching  can  be  carried  out  using  this 

Thesaurus  without  the  user  also  needing  recourse  to  lists  of  other  terns  used.  | 

OTIBR  USES  OF  TIC  MINI-aMVrER  ] 


18.  Another  current  application  for  the  in-house  coiqiuter  is  the  production  of  statistics  on  the  exchange 
of  reports  with  foreij^  countries,  mc  has  often  been  asked,  e.g.  by  members  of  MOD  HQ  concerned  with 
international  cooperation,  to  provide  statistics  on  the  interchange  of  reports  with  foreign  countries^  and 
these  requests  have  required  lengthy  manual  counts  in  the  past.  Moreover,  to  siiplify  the  actual  prepara- 
tion of  statistics,  the  clerical  staff  listing  reports  sent  to  or  received  from  id>road  have  used  different 
forms  for  each  major  country.  This,  of  course,  created  problems  idien  more  than  one  person  needed  a 
particular  form,  and  time  was  lost  in  changing  from  one  to  another.  Using  the  mini-con|)uter  to  prepare 
statistics  has  allowed  any  mxber  of  different  countries  to  be  entered  in  random  order  on  a single  form. 
Thus  each  clerk  now  uses  only  one  form  no  matter  which  countries  he  is  dealing  with,  and  no  one  else 
needs  to  use  it.  Minor  chants  had  to  be  made  to  the  format  to  make  the  forms  suit^le  for  ADP  work, 
but  these  changes  were  introdiced  to  the  staff  concerned  in  a short  talk  vMch  concentrated  on  the 
benefits  to  them  individually  and  to  mC  generally  and  also  eo^diasised  the  need  for  care  in  completing 
ADP  forms.  The  change-over  has  gone  reasonably  smoothly  and  tte  forms  are  generally  being  prepared  very 
carefully.  So  much  so,  that  vdien  trying  to  denonstrate  the  iiqiut  validation  program,  three  months  after 
the  change,  we  were  unable  to  find  a recent  t^  with  errors  had  to  create  some  artificially!  This 
is  a tribute  to  the  accuracy  both  of  the  clerical  staff  completing  the  forms  and  of  the  tape  typists 
punching  from  them.  There  are  occasional  problems  with  unclear  haixl-written  details  of  destinations 
(organisations  not  countries)  but  these  are  less  iiportant  than  the  other  details  recorded  since  they 
are  merely  used  as  records  and  not  as  elements  in  the  statistics. 
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19.  Standard  statistics  are  now  produced  each  quarter  showing  the  ramiier  of  classified  and  unclassified 
reports  sent  to  and  received  from  each  country.  More  specific  breakdowns  by  individual  months  and 
specific  countries  can  be  obtained  as  required,  and  these  listings  include  the  DRIC  accession  numbers  and 
the  precise  classification  of  each  report  to  enable  us  to  determine  in  more  detail  the  nature  of  the 
trade  with  the  countries  concerned.  An  exaiqile  of  the  quarterly  statistics  is  shown  in  Fig. 8.  Fictitious 
names  have  been  used  to  avoid  disclosing  possibly  sensitive  information. 
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20.  At  a later  date  we  intend  to  make  the  output  frooi  the  specific  listings  available  on  aagnetic  tape 
so  that  the  titles  or  the  cooplete  bibliogr^ihical  details  and  abstracts  can  be  extracted  frm  the  oain 
data  base  stored  at  the  ICL  1900  cooputer  bureau.  This  will  make  identification  of  the  subject  areas 
luch  easier.  This  is  not  as  straightforward  as  it  mi^t  seem  since  the  data  on  the  1900  is  stored  not 
by  accession  mnber  but  ^ location  reference  ninter,  a serial  nunber  indicating  the  position  and  the 
issue  of  DRA  in  vdiich  the  report  was  announced,  and  the  two  ntnbers  are  totally  unrelated. 

21.  Eventually,  we  hope  to  provide  VDUs  for  the  input  of  the  reports  exchange  data,  with  on-line  valida- 
tion. But  this  must  await  a feasibility  study  to  determine  both  its  practicality  and  whether  it  is  cost- 
effective. 

Security  suite 

22.  Another  a]plication  for  the  ndni-cooputer  is  a system  for  recording  the  movement  of  classified 
reports.  As  mentioned  in  paragra^rfi  12  above,  the  present  system  provides  a 'movements'  card  on  udiich 
details  of  initial  distribution  and  subsequent  requests  are  recorded.  The  movements  cards  are  held 
in  accession  nunber  order,  there  being  only  one  for  each  report.  The  only  other  way  to  obtain  report 
details  is  from  the  corporate  author  (originating  body),  another  card  being  filed  in  this  order  for  each 
report  and  acting  as  a cross-reference  to  the  movements  card  \diich  contains  the  full  details.  However, 
if  docunent  movements  were  entered  on  the  conputer,  any  desired  listing  could  be  obtained,  and  it  is  our 
intention  to  do  so  if  a feasibility  stufy  shows  that  it  is  practicable.  This  system  will  te  in  batch 
mode  initially,  but  it  is  possible  that  on-line  working  could  be  developed  for  this  operation  also. 
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Field  of  Interest  Register  ^ I 

23.  When  requests  for  reports  or  searches  are  received  from  organisations  outside  the  Ministry  of  Defence,  ,j 

MUC  has  to  check  that  they  have  an  appropriate  'need  to  know',  ie  that  they  are  carry^  ^t  work  for  ^ 

Ministry  in  the  subject  area  concerned.  To  this  end,  it  is  intended  to  prepare  and  maintain  on  the  mini- 

conputer  a 'field  of  interest'  register  listing  Defence  contractors  and  the  areas  of  interest  for  vdiich  ; | 

they  are  accredited.  This  will  be  printed  in  aljAiabetic  order  at  regular  intervals,  but  it  will  alM  be  ] i 

possible  to  ask  for  individual  entries  to  be  displayed  on  the  VDUs.  Another  use  for  Ae  register  will_  '•  I 

be  to  maintain  a list  of  the  subject  interests  of  M)D  customers  to  help  IXUC  make  an  initial  distribution  j 

of  reports.  ij 

SDl  ^Lnd  retrospective  searching  i 

.j 

24.  Two  applications  dependent  ipon  considerable  expansion  of  the  conputer  are  SDI  and  retrospective  ,| 

searching.  The  ICL  1900  conputer  used  for  these  puiix)ses  is  likely  to  be  replaced  by  a 2900  'New  Range'  j 

nmchine  in  a few  years'  time.  Rather  than  re-program  for  that  machine,  we  intend  to  assess  the  possibi-  i 

lity  of  using  the  mini-conputer.  | 

' 25.  A monthly  SDI  should  be  straightforward  on  the  mini-conputer,  once  the  retrieval  program  has  been  j- 

written,  but  retrospective  retrieval  from  an  on-line  data  base  may  require  more  disc  storage  than  will  be  i 

I available  even  with  the  enhancements  planned  by  the  manufacturers.  To  overcome  this  difficulty  we 

; propose  to  examine  the  possibility  of  storing  only  the  inverted  files  on-line  and  using  a conputer- 

1 controlled  automated  microfiche  reader  to  hold  the  bibliograirfiical  details  and  the  abstracts.  Such  a 

j system  was  investigated  by  Project  INTREX  (ref.4)  and  has  been  used  by  the  European  hkiclear  Docuaenta-  i , 

' tion  Service  (5) . Coranercial  systems  are  now  available  or  under  development  (6) . If  we  do  use  such  a 

‘ system,  we  should  also  be  able  to  provide  an  on-line  service  for  a few  large  M®  establishments,  who 

j would  each  hold  a diplicate  set  of  fiche.  This  would  be  particularly  appealing  since  the  coommications 

! system  would  carry  only  search  statements  and  the  nunbers  of  the  docimients  matching  those  statements. 

Thus  no  additional  security  problons  would  arise. 
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4.  Overhage,  C.F.J.  & 
Reintjes,  J.F. 


Project  INTREX;  a general  -review 

Information  Storage  and  Retrieval,  Vol.lO  No. 5/6,  May/June  1974,  pp  157-188 
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5.  Vernifflb,  C.  Abstracts  on  microfiche  for  on-line  retrieval 

In  AG\RD  Conference  Proceedings  No. 136  - "New  Developments  in  Storage, 
Retrieval  and  Dissemination  of  Aerospace  Information",  October  1973 


6.  Duerden,  F.  A oonputer-controllable  ultrafiche  terminal 

In  AGARD  Conference  Proceedings  No.  136  - "New  Developments  in  Storage, 
Retrieval  and  Dissmnination  of  Aerospace  Information",  October  1973 
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PHOBLEMS  IN  PPLajCOTATlOW  OP  Tlffi  MINI-OCMVIER 


26.  As  was  only  to  be  expected,  a nuaber  of  problens  arose  during  the  early  days  of  using  the  nini- 
coaiMter,  although  none  of  them  was  unduly  serious.  Perhaps  the  most  inportant  - and  certainly  the  most 
time-consiming  - was  the  need  for  parallel  running.  It  had  been  decided  at  an  early  stage  that  parallel 
running  of  the  old  and  new  systems  for  producing  DRA  was  necessary  but  we  failed  to  realise  the  full 
iaplications  of  this  decision  until  the  work  was  well  advanced.  For  true  parallel  naming  it  would  have 
been  necessary  to  have  punched  the  data  twice  over,  but  this  would  have  required  considerable  extra  effort 
by  the  rap»»  typists  and  was  totally  inpracticable,  particularly  if  it  were  to  be  cantinued  tor  3 months 
as  seoned  likely  to  be  necessary.  The  work  could  not  have  been  carried  out  elseidiere  because  the 
typists  would  have  needed  special  training,  some  of  the  codes  used  by  DRIC  might  not  have  been  found  on 
standard  tape  typewriters,  and  it  would  probably  have  been  difficult  to  find  an  M®  installation  able  to 
carry  out  the  work  or  a bureau  allowed  to  handle  classified  material. 

27.  The  input  format  for  the  mini-coaputer  system  had  been  chosen  for  its  ease  of  use  on  the  mini  and 
differed  significantly  from  that  used  previously  on  the  ICL  1900  computer.  Item  separator  synimls  were 
placed  before  the  end  of  a line  instead  of  after  it,  and  two  fields  had  been  interchanged  to  mate  their 
order  more  logical.  Moreover,  two  essential  new  fields  had  been  added,  the  OOSATI  subject  headings  and 
the  edition  of  I»A;  and  the  mini-coaputer  code  for  end  of  record  (01,110  differed  from  that  of  the  ICL 
1900  (Newline). 

28.  To  avoid  the  necessity  for  double  punching,  it  was  decided  to  punch  the  data  as  for  the  adni- 
coaputer  and  convert  it  by  a special  program  into  the  fonaat  required  for  the  ICL  1900.  Although 
this  was  not  true  parallel  naming,  it  enabled  the  new  system  to  be  tested  in  its  fii^  form  since 

any  errors  in  the  special  program  would  affect  only  the  old  ^stem,  which  was  otherwise  unchanged.  This 
did  in  fact  prove  to  be  a satisfactory  coapromise.  The  special  program  h^  to  be  written  in  two  parts. 
Firstly,  there  was  an  automatic  tape  program  for  the  tape  typewriters,  vdiich  removed  the  new  fields  and 
converted  the  tape  into  a form  acceptable  to  the  ICL  1900  tape-reader.  An  additional  tape-punching  was, 
of  course,  required  but  this  could  be  done  automatically  witteut  curator  action,  apart  frOT  changing 
tape  spools.  Secondly  a conputer  program  was  written  for  the  ICL  1900  to  edit  the  input  into  a form 
suitable  for  the  existing  index  production  programs.  For  this  we  used  the  extremely  powerful  ICL  George 
3 editing  system.  In  theory,  we  could  have  amended  the  index  production  program  to  accept  mo^fied 
input,  but  that  program  had  been  written  under  contract  8 years  before  and  had  had  many  modifications 
since.  Not  smrprisingly,  we  were  very  reluctant  to  touch  it,  especially  as  it  had  caused  problems 
earlier  in  the  year  which  had  taken  several  months  to  cure. 

29.  All  this,  of  course,  took  a considerable  amount  of  extra  effort  vdiich  had  not  been  planned  for, 
and  telped  to  delay  live  running  by  one  month  beyond  the  planned  starting  date  of  Janimty  1976.  This 
caused  another  problem.  The  first  two  issues  of  ERA  were  prepared  on  the  tape  typewriter/ia  1900 
system  and  the  remaining  four  issues  for  the  first  quarter  of  1976  were  prepared  by  the  mini -conputer. 

The  indexes  for  each  issue  were  prepared  by  the  corresponding  conputers,  but  Ae  quarterly  cuaulated 
indexes  could,  of  course,  be  provid^  only  by  one  or  the  other  system,  not  jointly.  To  have  produced 
the  first  quarter's  accunulation  on  the  1900  would  have  been  possible  but  that  would  just  have  deferred 
the  problem  until  the  annual  acamHation  vas  needed,  because  that  also  had  to  be  produced  by  one 
conputer  and  not  by  both. 

30.  So  it  was  decided  that  the  first  quarterly  accumulation  must  be  prepared  by  the  mini-conputer. 

At  first,  it  seemed  necessary  only  to  re-punch  the  first  two  issues  of  1976  some  t^  during  the 
quarter  in  mini-conputer  format  and  then  process  them  in  the  mini-conputer  to  provide  the  necessary  data 
for  the  accunulation.  However,  it  was  realised  (fortunately)  that  any  errors  in  the  manual  sorting  for 
the  1900  (or  in  the  punching  of  the  OOSATI  headings  for  the  mini-conputer),  if  not  detected  and  correct^, 
would  result  in  reports  teing  given  different  location  reference  numbers  (see  paragraib  20)  by  the  mini- 
conputer  from  ^se  they  had  been  assigned  manually.  This  would  have  rendered  the  indexes  valueless  and 
was  considered  too  great  a risk  to  be  justifiable  and  so  another  method  was  sou^t.  Eventually  two 
programs  had  to  be  written,  one  for  the  1900  to  list  on  magnetic  tape  the  location  reference  nunbers  used 
together  with  the  corresponding  DRIC  accession  numbers,  and  one  for  the  mini-conputer  to  accept  this  tape 
and  change  the  location  reference  numbers  which  it  had  generated  to  the  values  which  had  actually  been 
used.  This,  too,  took  time  which  had  not  been  planned  for. 

31.  Could  these  problems  have  been  avoided?  Yes,  if  we  had  taken  a chance  and  decided  not  to  bother 
with  parallel  running  (a  gamble  vbich  would,  in  fact,  have  come  off,  but  vbich  it  would  have  been  very 
risl^  to  take)  or  if  we  had  prepared  two  separate  iiputs;  and  they  could  have  been  eased  to  a consider- 
able extent  if  we  had  made  fewer  changes  in  the  format.  Perhaps  there  is  a lesson  to  be  learned  that_ 
changes  should  not  be  made  when  designing  a new  system  unless  their  full  inpact  on  the  old  system  during 
parallel  running  has  been  assessed  and  acc^ted. 

32.  Other  difficulties  have  occurred  in  the  early  stages  of  running  the  mini-conputer  programs  but  they 
were  mainly  caused  by  infrequent  combinations  of  conditions  which  had  not  occmred  in  the  test  data,  and 
the  only  lesson  that  can  be  learned  from  this  is  the  old  one  that  testing  must  be  as  exhaustive  as 
possible,  but  that  bugs  will  still  lurk  unseen  until  full  production  running  begins  fo  earnest.  Some 
may  never  rear  their  ugly  heads  - but  the  probability  of  tJiat  is  low  in  most  comnercial  computing! 
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33.  In  conclusion,  I think  it  is  clear  that  a snail  comiuter  such  as  ours  can  be  used  to  perform  many 
tasks  of  a quite  sophisticated  nature  for  a relatively  low  cost  - a fraction  of  the  price  of  a main- 
frame coo|xiter. 

CritiER  USE  OF  OOUVrERS  IN  IHE  BRITISH  MINISTRY  OF  MFENCE  INFORMATION  AND  LIBRARY  SERVICES 

34.  Although  not  strictly  relevant  to  the  title  of  this  paper  or  the  subject  matter  of  this  session,  it 
was  thought  that  a very  brief  sunnary  of  other  uses  of  con|xiters  in  information  and  library  work  in  the 
Ministry  of  Defence  would  be  of  interest. 

OUC 

35.  In  addition  to  the  GEC  4080  coafxiter,  ERIC  still  makes  use  of  the  ICL  1900  ccoyxiter  at  a Ministry  of 
Defence  Bureau  in  London.  This  is  used  for  two  purposes:  Selective  Dissemination  of  Information  (SDI); 
and  retrospective  searching.  Bibliographic  data  and  abstracts  of  reports  that  have  been  processed  on 
the  4080  con|niter  are  sent  on  magnetic  tapes  to  the  Bureau  twice  a month.  An  SDI  service  is  provided 

on  these  data  every  month  for  members  of  MM)  Establishments  and  Headquarters  Branches.  Ihese  data  have 
been  accunulating  since  1970  and  will  shortly  be  used  for  on-line  interactive  retrospective  searching 
by  ERIC  staff  in  response  to  requests  from  other  mend>ers  of  MM)  and  Defence  Contractors.  ERIC  also 
supplies  magnetic  tapes  monthly  to  certain  of  the  larger  R4D  establishments  to  ensible  them  to  run  their 
own  SDI  services  in-house. 

Atomic  Weapons  Research  Establishment.  Aldermaston  and  Foulness  (AMRE) 

36.  AWRE  at  Aldenmston  uses  an  IBM  370  conputer  for  a ntmber  of  Information  and  library  purposes.  One 
is  an  SDI  service  based  on  published  data  bases  (US  Government  Reports  Announcements,  Chemical  Abstracts 
Condensates,  INSPEC,  Nuclear  Science  Abstracts,  and  MEEAIffiX).  The  SDI  service  was  initially  provided  for 
members  of  AlffiE  staH,  but  has  now  been  extendi  to  many  R&D  establishments  in  the  Ministry  of  Defence, 
under  the  aegis  of  DRIC. 

37.  The  Aldermaston  Mechanized  Cataloguing  and  Ordering  System  (AMCOS)  (ref. 7)  is  based  on  MARC  (Machine- 
Readable  Cataloguing)  records  produced  by  the  British  Library.  It  has  two  main  functions:  firstly  a 
selection  of  references  relevant  to  AWRE  is  extracted  from  the  MARC  tapes  using  the  Dewey  Decimal 
Qassification  field.  This  list  forms  a Potential  Requirements  File  from  which  the  Library  staff  select 
books  for  purchase,  and  from  which  required  data  are  extracted  to  generate  the  orders.  Books  are  of 
coin^e  selected  from  other  sources  as  well.  Secondly,  the  MARC  records  are  used  for  cataloguing  purposes, 
with  the  addition  of  local  information  such  as  shelf  niaber,  and  accessions  lists  and  catalogues  are 
prepared  by  the  ccnputer.  The  catalogue  is  cumilated  quarterly  by  printing  out  in  CON  format  on  micro- 
fiche. 

38.  The  Aldermaston  Library  also  runs  a semi -mechanized  loans  system,  using  punched  cards,  and  produces 
a UDC/Aljdiabetical  Subject  Index  List  for  cross-reference  purposes. 

39.  The  AWRE  outpost  at  Foulress  has  developed  an  on-line  loans  control  system  vhich  has  been  described 
in  ref.  8. 

Defence  Operational  Analysis  Establishment.  West  Byfleet  (DQAE) 

40.  DOAE  has  an  on-line  retrospective  informatim  retrieval  service  based  on  the  reports  held  by  the 
library.  The  software  for  this  was  developed  as  an  extension  to  the  ICL  1900  con|>uter  operating  system, 
GEORGE  3,  the  DQAE  conputer  staff.  An  SDI  service  from  the  reports  accessions  is  also  run  for  members 
of  DOVE  staff  and  at  least  one  external  establishment.  A Library  accessions  list  is  prepared  by  the 
coiputer.  Records  of  technical  files  are  also  held  on  the  conputer,  lists  being  printed  out  in  KNX 
format. 

Royal  Aircraft  Establishment.  Famborough  (RAE) 

41.  RAE  have  rhree  operational  Library  conputer  systems:  a UDC/Subject  Index  Listing,  a Register  of 
Interest  of  their  scientists,  and  an  Annual  Index  to  RAE's  Technical  Reports. 

National  Gas  Turbine  Establishment.  Pyestock  fNGTE) 

42.  NGTE  uses  the  RAE  Technical  Reports  program  to  produce  author  and  KWDC  accession  lists  of  books  and 
reports. 


7.  Corbett,  L.  ft  /JCOS Project  stage  2:  a conputer-aided  integrated  system  using  BNB  MARC 
German,  J.  literature  tws. 

Program,  Vol  6,  No.  1,  January  1972,  pp  1-33 
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43.  RSRE  have  cci|)uterized  their  Periodicals  List  and  also  prepare  a XHDC  index  to  their  Newsletter 
also  using  the  RAE  program.  Ihe  KNOC  index  will  shortly  be  extoided  to  include  report  material. 


Anmments  Research  and  Devel 


t Establishment.  Fort  Halstead 


44.  RAREE  nm  an  SDI  service  based  on  the  reports  received  by  their  Library. 

Other  Establishments 

45.  Other  establishments  are  investigating  the  possibility  of  using  the  conputer  for  on-line  retro- 
spective retrieval  or  for  loans  control,  iuid  several,  including  some  listed  above,  are  planning  in- 
house  Selective  Dissemination  of  Information  services  using  tapes  siq>plied  by  ERIC. 


I should  lilce  to  adcnowledge  with  mich  gratitude  the  help  given  in  the  preparation  of  this  paper  by  my 
colleagues  in  ERIC.  Any  errors  are,  of  course,  mf  own  responsibility.  Any  views  expressed  are  own 
and  do  not  necessarily 'represent  those  of  the  Ministry  of  Defence  or  FM  Govemnent. 
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The  growing  need  for  information  services  poses  increasing  problems  for  libraries  and  documentation 
centers.  Data  processing  techniques  provide  several  possibilities  for  improvement;  among  the  most  recent  techniques  are 
the  use  of  minicomputers. 

A description  of  the  mode  of  operation  In  library  loan  posting  is  given,  as  well  as  a discuuion  of  the 
extent  and  structure  of  ^ta,  linkage  possibilities,  and  special  operational  features. 

For  these  specialized  applicatioro  a short  survey  of  the  hardware  configuration  and  software  of  minicom- 
puters is  also  presented.  Comparison  is  made  betvreen  a stand  alone  system  and  a minicomputer  connected  to  a background 
computer.  This  study  is  based  on  actual  projects  existing  In  the  Federal  Republic  of  Gemrany. 


1 .  BASIC  PURPOSE 

1.1. 

When  libraries  start  with  EDP,  very  often  they  start  with  circulation  control . This  is  the  situation  In 
Germany  in  any  case  with  Its  traditionally  structured  libraries. 

The  reasons  are: 

1 . Increase  in  quantity  of  the  borrowers. 

2.  Users  demand  on  the  library. 

3.  Shortage  of  library  staff  arid  Increasing  costs. 

4.  Inefficietscy  of  monual  operation. 

The  basic  for  an  effective  application  of  computers  in  the  library  is  given  by  a large  quantity  of  data  and 
a repeated  access  to  identical  data.  This  takes  place  In  any  larger  library. 

1.2. 

The  purpose  of  circulation  control  is  quite  simple  - to  assure  that  books  or  other  materials  the  library  hold, 
are  available  to  users  who  need  them,  within  a reasonable  period  of  time  and  for  a reasonable  period  of  time.  The  data  of 
circulation  control  should  be  gotten  as  fast  os  possible  and  in  a cost-effective  way. 


1.3. 

There  are  three  different  types  of  data: 

1 . Infonnations  concerning  circulation  data  (Dote,  loan  desk  . . .) 

2.  Infonnations  concerning  the  user  (Name,  address,  status  . . .) 

3.  Infonnations  concerning  the  book  (Book  identificotion,  shelf  mark,  reminder,  classified  docunrents  . . .) 

Libraries  of  course  differ  greatly  in  the  importance  they  attach  to  orw  or  another  of  these  functioru, 
reflecting  differences  in  the  purpose  of  the  libraries  themselves.  Seme  have  the  requirement  of  very  rapid  accessibility, 
others  have  a lot  of  special  collections  with  different  kinds  of  using  them. 


2.  LIBRARY  CIRCULATION  AND  CONTROL  IN  WESTERN  GERMANY 

Most  of  the  research  libraries  in  the  Federal  Republic  have  hod  no  direct  connection  to  computer  centers 
until  quite  recently,  therefore  they  first  developed  off-line  systems.  This  procedure  has  some  disadvantages,  book  reserva- 
tions are  difficult,  blocked  users  cannot  be  stopped  at  the  loan  desk  and  there  is  alwoys  a gap  of  information  between 
two  proceuing  steps  of  the  computer. 

An  on-line  system  is  able  to  give  this  information  at  orKe  by  direct  access.  The  trend  in  the  lost  few 
years  In  library  circulation  and  control  In  German  research  libraries  has  tended  towards  the  use  of  on-line  systems.  At 
present  the  following  on-line  circulation  systems  are  working:  Aochen,  Bielefeld,  Bremen,  Dortmund,  Dusseldorf,  Kiel, 
Munster. 

There  have  been  two  alternative  pouibilities  for  on-line  data  processing,  to  be  a participant  in  a central 
computer  or  to  use  stand  alorte  computer  In  the  library.  A new  situation  is  created  by  the  development  of  package  mini- 
computers to  support  local  library  furtetiom,  especially  In  circulation  control. 


3.  HARDWARE  OF  MINICOMPUTERS 


3.J . 

Th*  mau  producHon  of  tntogratod  circuits  hostwMd  th*  dovelopmant  of  mlnicompuiors.  Today  th* 

•fficlaney  of  mlnlcompotars  Is  comfxiroble  In  some  fialdsof  oppllcotion  to  that  of  lorB* 

Minicomputors  nomially  woik  with  a word  length  betwoan  8 and  32  bits.  Th*  dwrtor  the  word 
lamer  Is  the  mejoiiement  of  moin  storage  for  retention  or  doto  and  programs.  Usually  most  of  the  syst^  hove  word 
lergths  of  16  bits.  Some  of  these  are  odressoble  by  position.  The  storage  is  generally  o combination  of  magnetic  core 
storage  and  semiconductor  storage  thus  building  o synthesis  of  economy  ond  quick  acceu.  , . , , 

Most  of  the  minicomputers  hove  a maximum  memory  sire  of  32  K words,  several  peripheral  equipment  con 

bft  oonrMcted  to  the  unit. 

3 2 

In  oddition  to  the  memory  size  the  program  structure  it  alto  important  for  the  applications  In  cirauittfion 
control . The  theoretical  estimation  of  the  program  require,nent  for  circulotlon  control  is  not  eosy;  os  a rale  it  is  safe  to 
assume  that  most  of  the  program  instructions  are  logical  operations  (compare,  move,  displace  etc.). 

3 3 

Another  factor,  which  effects  the  storage  capacity  it  the  size  of  operating  syttem.  This  amount  is  lost 
to  the  user.  Normally  the  operatirrg  systems  need  about  4 K, 

3 4 

In  addition  to  the  system  Inherent  capocity  standards  the  ute  of  the  storage  oapocity  ft  tt^ly  dependent 
on  the  ways  and  frequency  of  library  monoglng.  It  mdees  o difference  whether  the  computer  is  a stand  alone  system  or  only 
partly  so,  whether  one  or  various  terminals  are  connected,  whether  only  check  in  ond  chedc  out  procettet  ore  to  ^ed 
^ or  whether  there  is  o requirement  for  constont  queries  about  users  or  books.  Moreow  there  ore 
efficiency  of  a minicomputer  system,  su<h  os  kinds  of  input,  disc  uniH  or  doto  protection-,  but  > •«  i* 

theoretical  survey,  ond  show  you  instead  two  examples  of  how  minicomputers  work  in  research  1 lories  of  the  Fe^ral 
Republic  of  Germany.  They  are  the  circulation  system  of  the  University  Library  of  MUnster  and  the  plans  for  the  foture  of 
the  circulation  system  in  Hsssian  libraries. 


4.  THE  UNIVERSITY  LIBRARY  OF  mONSTER,  A STAND  ALONE  SYSTEM 


The  University  Library  of  Munster  Is  a traditional  closed  stack  library  with  more  then  1 .2  Ml(^  volumes. 
The  clreulotlr^  moterials  are  In  closed  stacks.  There  are  max.  3600  orders  for  loans  per  day,  on  ^rage  2100^tual 
loans  max.  2000,  on  avertge  1300,  and  os  many  returns.  The  loan  dedr  are  opened  7 .5  hours.  The  syttem  of  book  ooll 

numbers  Is  difficult  and  heterogeneous.  . , 

Shortage  of  personnel  made  a computerized  circulation  system  indispensable.  Financial  consideratlom  ^ 

and  the  fact  that  the  university  computer  was  oireody  completely  utilized  by  others  led  to  the  ® * , 

alone  computer,  type  IBM  System/3  Model  10.  It  consists  of  o central  processing  unit  with  a 32  K Byte  a c^ie 
typewriter,  o line  printer,  o cord  unit  with  vorlous  functions,  three  mwll  discs  with  2.47  MIo  bytes  each  and  o disc 

storage  with  20.48  Mio.  bytes.  ..  oi  rvm  « 

The  purchose  price  for  the  sysfem/3  wos  about  310  000  S mointenance  choiges  ore  ofcout  21  000  S. 


With  a grant  from  the  Geimon  Research  Society  the  library  embarked  on  a pilot  project  to  develop  o low 
cost,  stand  olone  circulation  system,  which  should  by  suitoWe  for  use  in  other  libraries  with  closed  stotks. 

The  systenw  was  designed  for  the  following  functions: 

a)  Instant  loan  from  closed  stacks. 

b)  Complete  ond  quick  information  on  books  loaned. 

c)  Automized  cheek  in  and  cheek  out,  reserving  books,  renewals  ond  reminders. 

d)  Avoidance  of  going  into  the  closed  stocks  by  direct  pauing  of  orders  with  only  positive  response. 

e)  Perseverance  of  the  call  number  system  (without  changes),  and  no  additional  book  equipment. 


Eoch  loan  Is  proceeded  by  a request.  User  code  and  book  nwnber  are  entered.  The  user's  identifiootlon 
cord  (os  well  as  the  return  entry  card)  is  o 96-column  punched  card.  The  book  numbers  are  ^ optiwl  "wjjt 

enquiry  cords.  The  cord  contains  a number  of  columns  with  10  positions  each.  The  user  intern  the  1^  lusnber  in  these 
columnt  in  the  form  of  pencil  marks.  The  multi-purpose  card  unit  con  rood  the  punched  cards  cs  well  os  the  optical 


moih  g request  slip  Is  turned  In,  the  information  on  the  book  - request  granted,  or  otherwl^  the  where- 

obouts  of  the  book  - is  returned  in  less  than  30  seconds.  The  recording  procedure  itself  tekes  1 .4  seconds.  Preporing  o 
book  for  loan  requires  a half  hour  on  the  average.  The  reasons  for  this  relatively  long  time  ore  not  to  be  found  In  the  area 
of  doto  procenir^,  but  rather  in  the  poor  personal  situation  in  the  stocks. 


Th«  low  dtK  oopoclty  mokM  It  iwcMMry  to  Hto  book  flU  m a iwaotivo  flU  (tromoctlon  locord 

of  loonod  books).  Tho  book  fllo,  obbroviolod  uwr  fllo,  and  tho  umt  oddimi  flU  «•  indviod.  A good  dwl  of  offort  hod 
bowl  ipwit  toardkii«  through  tho  roquotH  to  sort  out  thoN  books  which  roquiro  qwclol  trootmont  or  which  oro  not  kopt 
In  tho  closod  stacks.  This  Information  Is  contalnod  In  auxiliary  fllos  on  disc  steros. 

On  tho  wholo,  tho  Informotlon  lor«th  Is  kopt  vory  briof.  In  part  ovon  os  bit  Infbrmotlon.  Tho  vrholo  soft- 
waro,  with  tho  oxcoptlon  of  tho  programs  portolnlng  to  tho  oporoting  sysiom  hat  boon  nowly  writton  In  programming 
loi^uago  RPG  II  (Roport  Program  Gonorator  II).  Tho  dovolopmontol  photo  roquirod  fivo  mon-yoort. 


Tho  ovoluotlon  of  tho  procoduro  thews  tho  following  rotults: 

With  practically  no  Incroaso  In  porionnol,  tho  loon  quota  roio  by  70  %.  Tho  circulation  of  books  hot  boon  occolorotod. 
Tho  torvico  It  quickor  thon  boforo.  Tho  Infomwtlon  on  tho  whoroabouts  of  tho  books  Is  varied  ond  Infoonotlvo.  Tho  down 
timo  of  tho  mlnlcomputors  It  Insignificant  and  hardly  offbct  tho  loon  tytlom.  Anothw  odvontago  it  tho  floxiblo  provontivo 
malntononoo  tImo,  which  can  bo  orrongod  outsido  of  oporoting  hours,  and  tho  fact  that  tho  system  con  bo  put  to  use 
spocifloolly  according  to  tho  prevailing  oonditlont. 

It  It  to  our  dlsodvontago  fhot  ony  onlatgomont  of  tho  moin  memory  unit  and  tho  addition  of  further  terminals 
ora  noorly  Impossible,  to  say  tho  least  vory  costly.  Tho  connection  of  tovorol  Input  units  In  more  or  lost  distant  locations 
It  hardly  possible.  The  capacity  of  tho  printer  Is  alto  dlssotlsfactory  ot  6000  llrwt  per  hour. 


5.  CIRCULATION  CONTROL  IN  HESSIAN  LIBRARIES 


Tho  dovolopmont  of  the  use  of  EDP  in  tho  Hessian  librarios  took  o different  course.  Allow  me  to  mdee 
some  basic  observations  on  this  subject  in  advance.  In  1970,  fivo  roglonal  community  computer  centers  with  identical 
computer  systems  where  faursdod  In  Frankfurt  am  Main,  GieBen,  Darmstadt,  Kassel  attd  Wiesbaden.  They  are  responsible 
far  the  automation  of  different  functions  in  public  odministrotion.  The  public  libraries  and  research  libraries  are  both 
linked  to  this  network.  The  goal  is  to  ochleve  on  integrated  library  system  step  by  step.  For  this  reason,  all  processes 
are  being  developed  under  tho  aspoct  of  univorsal  opplicobiblity,  allowing  the  participation  of  all  types  of  libraries.  The 
processing  of  local  data  takes  place  in  each  regional  computer  center;  data  to  which  several  libraries  must  have  occess 
(bibliographical  data  os  a rule)  is  processed  in  central  files  in  the  regional  computer  center  in  Frankfurt.  Tho  data  process- 
ing network  operates  free  of  cost  for  the  state  ar»d  community  administrations.  The  state  of  Hessen  carries  the  subsidies. 

As  a result,  analysis,  programming,  and  processing  are  carried  out  by  tho  network  whereas  the  data  compilation  ond  irsput 
is  organized  ond  financed  by  the  individual  library.  Under  these  conditions  the  development  of  the  automation  of 
circulation  control  vras  begun  in  1971 . Included  in  tho  process  are  local  loons  and  active  and  passive  interlibrary  loon 
both  from  tho  closed  and  from  the  open  stacks.  Internal  lending  between  the  branches  of  a library  os  well  ns  borrowing 
and  returning  books  at  a branch  of  another  library  is  possible. 


The  system  operates  completely  with  numerical  informotion  on  users,  books,  and  participating  libraries. 

For  financiol  reasons  vre  started  with  an  off-line  version.  The  processing  was  carried  out  by  eoch  respective  regional  com- 
puter center  (currently  seven  libraries  are  porticipating) . From  the  beginning  the  files  have  been  stored  on  discs.  By  now 
this  off-lirre  version  Ikis  been  replaced  in  some  libraries  by  the  installation  of  minicomputers  of  the  system  Nixdorf  720  far 
the  input  of  alphanumeric  data  (user  and  book  informotion)  and  a system  Nixdorf  710  for  the  recordirtg  of  numerical  date 
(loon  and  return  transactions).  The  system  Nixdorf  710  offers  12  K of  its  main  storage  for  data  storage,  the  system 
Nixdorf  710  8 K.  This  modification  - employed  in  Hessen  initially  by  the  Gesamthochschulbibliothek  Kassel  - mtdces  use 
of  job  sharing  between  the  tttain  computer  and  the  minicomputer  in  the  library  as  fallows: 

The  user  data  and  bo^  data  are  managed  separately  in  the  magnetic  disc  stores  of  the  computer  center 
after  they  hove  been  recorded  on  magnetic  tape  in  the  library.  The  book  information  file  covers  the  complete  circulating 
holdings  of  the  library.  The  record  hos  a fixed  length  of  60  characters  and  consists  of:  1)  library  identification  code, 

2)  book  number,  3)  book  identification,  4)  user  number  (this  field  is  only  occupied  when  the  book  is  loaned  out),  5)  dote, 
6)  statistics,  7)  subject  code. 

The  user  file  has  a fixed  length  of  92  bits,  containing 
the  library  code  the  number  of  loans 

the  user  code  statistical  data 

the  users'  persorrol  data 

The  data  of  10  000  users  take  up  about  4.5  % of  the  capacity  of  a disk  pock.  Tho  indexing  system  (ISFM) 
stores  the  records  ond  the  processing  is  dorte  by  usirrg  index  sequentiell  access  method  (ISAM). 

However,  reserving  of  books,  reminding  and  identifying  of  blocked  users  is  done  with  the  minicomputer  in 
the  library.  As  the  system's  connection  to  the  central  computer  is  on-line,  immediate  infomrations  on  user  and  book  data 
ate  available  during  opening  hours.  All  botch  processing  is  also  done  on  the  tiKiin  cotrtputer. 

This  way  of  using  the  computer  is  fairly  optimal  because  the  processing  of  tho  mass  of  routine  data  (posting) 
advantage  is  taken  of  the  central  computer's  capacity  and  its  peripheral  equipment,  whereas  for  all  odditional  todcs  the 
direct  access  to  the  minicomputer  and  its  possibility  of  immediate  processing  are  mode  use  of. 


5.3. 


TtiU  Is  Iha  way  lha  lytiom  woiki: 

Tb*  ItoiM  of  books  or*  IdontlfM  by  moons  of  bor-oodod  lobols.  A light  pon  scons  tho  bar  code  and  recognires  the 
number.  A check  digit  facility  built  into  the  label,  ensures  the  accuracy  of  the  data.  Users  are  also  identified  by 
unique  bar-coded  labels  offing  to  user  cords. 

Durirqi  check  out,  for  instance,  the  operator  passes  the  light  pen  over  the  bar-coded  label  of  the  user 
cord  ond  the  book.  This  automatically  records  the  transoetion.  The  recording  of  a dischorge  is  accomplished  simply 
by  passing  the  light  pen  ocrou  the  label  of  the  book.  If  there  is  a hold  on  a book  or  if  it  is  overdue  the  system  informs 
the  operator  by  givir^  on  audio-visual  signal . Just  as  reserved  books  con  be  Indlooted  on  discharge,  users  whose  nisnbers 
have  been  stored  can  be  intercepted  at  the  issue  point.  All  transactions  which  take  place  at  the  terminals  are  written 
on  the  magnetic  tope  cassette.  It  is  then  necessary  to  feed  this  data  to  the  computer  for  the  updoting  of  the  user  file 
ond  book  file.  After  that  the  records  for  reserved  books,  reminders  and  blocked  users  ore  stor^  otrd  available  in  the  mini' 
computer.  This  way  of  writirqi  the  dote  on  magnetic  tape  cossettes  and  then  updating  them  on  the  main  computer  is  to  be 
discarded  in  the  near  future  and  to  be  replaced  by  imn^iote  input  and  erasure  of  the  holds,  reminders  and  blockings. 

5.4. 

We  intend  to  work  with  that  type  of  minicomputer  that  provides  sufficient  storage  capocity  for  additional 
softwore  and  data  by  disc  store.  It  is  sofo  to  assume  that  for  o library  with  obout  400  000  loons  per  year  a didc  capacity 
of  mart.  2 Mio.  Bytm  for  the  todcs  of  reserving  books,  reminding  and  blocking  will  be  sufficient. 


6.  CXJNaUSION 


6.1. 

Experience  has  shown  that  the  mechanization  of  book  circulation  on  minicomputers  can  led  to  cheaper 
costs  and  more  effoctivity.  The  smaller  range  of  the  minicomputer's  programs  and  functioning,  easier  to  control  opens 
up  a new  possibility  for  us  librarians:  otMe  again  we  ourselves  can  solve  the  greoter  port  of  our  problerRS  and  take  into 
consideration  the  local  needs  of  libraries. 

It  hos  also  become  evident,  thot  a stand  alone  system  with  fixed  structures  which  is  not  connected  to  a 
large  computer  is  soon  put  urxler  too  great  strain  because  of  the  permanently  growing  demands  on  it  orfo  the  necessity  to 
include  other  tasks  of  the  library  into  the  process  of  automation. 


6.2. 

It  seems  to  me,  that  the  development  of  powerful  flexible  and  cost-effective  minicomputers  on  one  hand 
and  the  embracit^  of  libraries  to  networks  with  a bibliographic  data  base  on  the  other  hand  is  a good  and  hopeful  evolution. 
The  oormection  of  minicomputers  to  o large  computer  within  a network  solves  specific  problems  of  one  library  os  well  os 
gives  the  possibility  of  dtaring  bibliographical  ^ta  orrd  library  resources. 

The  combination  gives  libraries  the  chance  to  integrate  functions  within  a library  and  also  build  up  a 
library  network  system.  Such  a concept  would  combine  the  best  advantages  of  both. 
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Summary 

It  is  wise  to  look  continually  for  possibilities 
to  reduce  the  mental  part  in  data  recording  in 
order  to  achieve  more  efficiency.  The  present 
state  of  minicomputer  development  offers  suit- 
able facilities  for  improving  data  recording  for 
information  retrieval  purposes  and  for  printed 
information. 

An  approach  on  that  line  is  being  undertaken  in 
ZMD.  The  hardware  configuration,  the  developed 
program  packages  and  also  the  experience  gained 
will  be  described. 

The  creation  of  data  needed  for  information  retrieval  and  computerized  production  of 
printed  information  is  a very  sensitive  task  because  of  the  extended  font  of  characters 
and  varieties  of  structures.  All  data  recording  machines  have  a limited  font  of  charac- 
ters. At  first  glance  it  would  seem  impossible  to  process  data  with  an  extended  font  of 
characters.  Therefore  it  was  necessary  to  find  methods  to  manage  more  comprehensive  fonts. 

One  of  the  possible  solutions,  developed  by  ZMD  in  1968,  is  the  creation  of  the  so-called 
protype  method.  A protype  stands  in  place  of  the  original  character  which  cannot  be  other- 
wise represented.  Protypes  are  composed  in  the  following  form: 

a basic  character  or  basic  figure  and 
a two-digit  number. 

Protypes  have  to  be  marked  in  the  text  by  a preceding  control  character,  e.g.  S. 

Examples : 

Protypes  representing  special  characters 
e.g.  a 


$a22 

t p control  character 

p basic  character 

p two-digit  number 

Protypes  representing  special  signs 
e.g.  % 


S36H 

L » control  character 

> basic  figure 

> two-digit  number 

This  protype  method  allows  the  representation  of  about  6 000  different  character^ 


The  original  data  does  not  contain  an  adequate  structure  for  processing  by  computer. 
Therefore,  the  data  must  be  structured  in  a unified  computer  acceptable  format.  That  nor- 
mally can  be  done  by  categorizing  and  by  using  control  characters.  In  the  data  processing 
stage  the  computer  can  recognize  the  data  elements  and  the  contents  of  the  data  from  the 
structure.  The  real  problem  is  to  get  the  structured  data  free  of  errors  in  order  to 
achieve  a high  degree  of  precision  in  the  stage  of  data  processing.  Therefore  it  is  very 
important  to  cheek  and  correct  the  data  during  the  recording  stage  and  before  data  pro- 
cessing by  computer  begins. 

ZMD  uses  paper  tape  typewriters  for  data  recording.  The  paper  tape  stored  input  data  then 
has  to  be  converted  onto  a magnetic  tape  as  this  is  more  suitable  for  cheeking  and  cor- 
recting by  program.  This  program  includes  checking  procedures  of  general  nature  as  well 
as  subroutines  for  special  checking  purposes  in  special  applications.  The  program  works 
by  getting  its  orders  from  parameter  cards  which  are  permanently  stored  in  the  system. 
These  parameter  cards  allow  the  program  to  meet  the  requirements  of  different  applica- 
tions and  projects.  These  parameter  cards  have  a specific  format  in  which  each  card  des- 
cribes the  error  checking  to  be  performed  on  one  category. 

Error  checking  is  conducted  on  such  points  as 

completeness  of  the  categorles-scheme 

- occurence  of  Certain  letters,  numbers  or  special  characters  in  certain 
prescribed  positions 

- prescribed  number  of  control  characters 

- interdependence  of  categories 

- length  of  a record , category , line  or  string 

- ascending  identification  numbers  or  ascending  tags. 

All  incorrect  data  including  a corresponding  error  message  are  listed  by  the  program. 


a 


The  detected  errors  are  corrected  by 

- replacement 

- Insertion 

- deletion 

of  records,  lines  or  words. 

The  correction  commands  have  also  to  be  punched  in  paper  tape  and  in  the  same  way  as  the 
original  data  they  have  corrected  onto  the  magnetic  tape. 

This  processing  cycle  of  data  recording,  converting,  error  checking  and  correction  has 
been  successfully  performed  for  several  years;  nevertheless  it  does  include  essential 
disadvantages : 


I 

1; 


ii 


- there  are  too  many  working  steps  at  too  many  places  performed  by  too  many 
machines  and  persons.  That  means:  time  wasting  and  susceptibility  to  er- 
rors 

- there  are  no  facilities  for  automatic  error  checking  during  recording 

- there  are  insufficient  facilities  for  direct  correction  if  errors  are 
detected  during  data  recording 

- the  workload  is  too  heavy  for  the  ZMD  computer  system  (IBM  370/lJ»5  as 
tyme-shaving  system,  which  runs  24  hours  per  day  and  is  dedicated  to  in- 
formation retrieval  and  text  composition  activities). 

Therefore  ZMD  was  looking  for  possibilities  to  avoid  these  disadvantages  in  the  data  re- 
cording stage  in  order  to  achieve  more  efficiency.  The  present  state  of  minicomputer  de- 
velopment seems  to  offer  such  improvements.  Especially  since  minicomputer  systems  are 
equiped  with  sufficient  disk  capacity  and  a powerful  central  processing  unit,  which  makes 
the  concept  of  a project-specific  checking  program  controlled  by  parameters  a reality  on 


* 


a minicomputer.  This  means  the  transformation  of  the  mentioned  card  images  into  a gene- 
rator program  which  specifies  the  error  checking.  In  other  words  the  minicomputer  can 
execute  the  same  error  checking  and  correction  functions  as  the  bigger  IBM  system  of  ZMD. 

In  order  to  do  this  a minicomputer  used  for  data  recording  purposes  has  to  be  equiped 
with: 

- disk  or  floppy-disk  units  with  sufficient  capacity 

- one  magnetic  tape  unit  compatible  with  the  IBM  computer  to  transfer  the 
disk  stored  data 

- visual  display  unit  (VDU)  with 

. sufficient  capacity  for  representing  a complete  document  and 
. a suitable  font  of  ch{u*acters,  which  includes  characters  in  upper 
and  lower  case  (The  not  directly  available  special  characters  and 
special  signs  could  be  represented  by  protypes) 

- printing  and  hard-copy  facilities. 

ZMD  has  rented  two  minicomputer  systems,  DATAPOINT  2.200  (OIER  Company)  and  CTM  70  (Com- 
puter Technik  MUller  Company),  which  fulfill  these  demands. 

The  special  programs  for  recording,  error  checking  and  correction  of  data  have  been  de- 
veloped by  ZMD.  The  programs  for  DATAPOINT  2.200  are  written  in  the  problem-oriented  lan- 
guage DATABUS,  the  programs  for  CTM  70  are  written  in  ASSEMBLER.  The  system  are  opera- 
tional, but  we  still  need  to  find  the  final  balance  between  the  user  demands  and  the 
technical  aspects.  In  particular  we  have  to  reduce  the  response  time  of  the  DATAPOINT 
system. 

At  this  stage  the  following  advantages  of  data  recording  by  minicomputer  are  clearly  re- 
cognizable: 

- more  simple  procedures 

Data  recording  and  immediate  correction  of  indicated  errors;  data  recor- 
ding, error  checking  and  correction  without  delay  and  without  any  unne- 
cessary intermediate  steps.  All  in  all  this  makes  the  cycle  simple  and 
effective. 

- reduction  of  the  main  computer  workload 

The  data  leaving  the  minicomputer  is  free  of  errors  thus  avoiding  any 
further  checking  and  correction  steps  on  the  main  computer.  This  reduc- 
tion allows  a more  effective  use  of  the  IBM  system  for  further  informa- 
tion retrieval  and  text  composition  activities. 

- no  data  conversion 

Data  conversion  can  be  omitted. 

- more  data  security 

More  data  security  will  be  obtained  because  of  less  data  transportation, 
permanent  availability 

Contrary  to  the  IBM  system  the  minicomputer  as  stand-alone  system  is 
available  permanently. 

- no  special  requirements 

No  special  requirements  of  minicomputer  location  have  to  be  considered. 

In  conclusion,  experience  has  shown  that  a minicomputer  in  connection  with  a visual  dis- 
play unit  offers  considerable  advantages. 

First,  the  program  controlled  data  recording  with  its  extensive  error  checking  procedures 
and  prompting  functions  leads  to  data  free  of  structural  errors. 

Second,  errors  in  contents  of  data  can  also  be  corrected  subsequently  just  as  in  conver- 
sational mode. 


The  minicomputer  is  able  to  master  aueceaafully  recording,  error  checking  and  correction 
of  non-numerlcal  data  independent  of  a large  computer  system.  This  makes  the  minicomputer 
very  interesting  for  libraries  and  documentation  centers. 
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SuBfflary 


The  paper  briefly  reviews  the  evolution  of  international  data  conaunlcations, 
and  then  considers  future  possibilities  for  development  in  more  detail.  Some 
of  the  issues  are  Illustrated  with  reference  to  the  European  Informatics 
Network,  and  the  Euronet  Project,  and  the  paper  concludes  with  a discussion  of 
problems  at  the  user  level. 

Introduction 


Since  the  advent  of  Teleprocessing  some  fifteen  years  ago,  its  impact  on 
society  has  grown  remarkably  - some  might  even  say  alarmingly  - in  spite  of  a 
number  of  fundamental  inhibiting  factors.  These  include  the  generally 
unsuitable  architecture  both  of  hardware  and  of  software  in  large  computing 
systems,  and  the  inadequate  facilities  that  are  provided  by  the  world's 
telecommunications  services.  This  paper  concentrates  on  current  approaches  to 
improving  telecommunication  services  for  the  purpose  of  data-communications, 
assuming  that  the  present  demand  for  these  services  reflects  the  basic  need  of 
people  to  communicate  and  to  obtain  information:  a trend  that  has  been 

apparent  since  the  dawn  of  structured  society. 

In  all  human  affairs,  the  immediate  future  is  largely  constrained  by  the 
present,  and  existing  ideas  and  facilities  are  clung  to  until  pressures  for 
change  become  irresistible.  Teleprocessing  began,  therefore,  with  the 
adaptation  of  the  telephone  network  to  carry  data  between  distributed 
components  of  computing  systems.  But  the  amelioration  of  the  fundamental 
mismatch  between  the  Internal  data  rates  of  computers,  the  external  data  rates 
of  people,  and  the  transmission  rates  of  the  standard  telephone  channel  has 
required  a variety  of  developments  based  on  the  distribution  of  logic  and 
storage  outside  the  telephone  network.  This  approach  to  teleprocessing  system 
design  still  continues. 

New  technology  is,  however,  rapidly  transforming  the  telephone  network  and 
digital  techniques  are  being  introduced  on  an  extensive  scale.  So,  another 
possible  way  to  design  new  data  networks  is  to  exploit  these  new  techniques, 
which  should  provide  better  facilities  for  data  transmission  and  switching  at 
marginal  cost.  On  a closer  examination  this  approach  proves  to  be  less 
attractive,  at  least  in  the  context  of  the  data  users  requirements. 
Nevertheless,  it  is  still  being  pursued  by  some  organisations,  and  this  has  led 
to  proposals  for  a new  digital  data  network  using  synchronous  transmission  and 
switching,  and  even  for  a future  integrated  telephone  and  data  network,  based 
extensively  on  digital  techniques. 

A more  radical  approach  is  to  consider  the  provision  of  communications 
facilities  between  people  and  computer  systems  as  an  overall  systems  problem. 
This  permits  the  adoption  of  solutions  based  on  new  principles  that  are  able  to 
provide  a data  network  with  properties  best  suited  to  user's  requirements. 
Packet  switching  has  been  the  result.  Private  data  networks  use  this  technique 
extensively,  and  most  of  the  current  activity  in  public  data  networks  la  in 
this  area. 

Packet  switching  appeals  to  computer  designers  because  the  features  it  offers 
meet  their  system  requirements,  while  they  can  readily  appreciate  the  principle 
of  dynamic  resource  sharing  on  which  it  is  based.  Hitherto,  communications 
engineers  have  seemed  less  convinced,  but  the  growing  number  of  proposals  for 
national  public  packet  networks  suggests  this  attitude  is  changing.  It  has, 
however,  required  the  construction  of  a number  of  experimental  packet  networks 
in  various  countries  to  achieve  this  evolution  of  opinion,  and  the  differences 
in  design  of  these  networks  seemed  to  make  prospects  for  a world-wide  public 
network  remote,  at  least,  Ontil  quite  recently.  These  issues  are  considered 
later,  using  the  European  Informatics  Network,  and  the  Euronet  Project  as 
examples. 


The  possibility  of  United  facilities  becoaing  available  for  International 
coBBunloatlon  using  packet  switching  now  appears  quite  promising,  but  even  if 
universal  data  connunlcations  facilities  were  already  freely  available,  there 
would  still  be  a major  hindrance  to  the  free  interaction  of  data  processing 
systems;  namely,  their  mutual  incompatibilities.  This  emphasises  the  pressing 
need  for  standards  at  the  user  level,  and  the  paper  concludes  with  some 
discussion  of  this  topic. 

Using  The  Existing  Telephone  Network 


Apart  from  a few  organisations  like  railways  that  have  their  own  communication 
lines,  it  has  been  generally  necessary  to  use  the  telephone  network  for 
carrying  data  between  components  of  any  distributed  computing  system.  This 
creates  some  considerable  difficulties  for  the  users  and  also  for  the  telephone 
network,  but,  nevertheless,  this  approach  has  had  to  be  the  basis  of  all  data 
communications  schemes  in  use  today.  Fundamentally,  it  has  required  the  use  of 
modems  to  convert  digital  signals  to  analogue  form  for  passage  through  the 
telephone  network,  and  has  also  required  the  use  of  storage  and  logic  outside 
the  network  to  overcome  some  of  it  disadvantages.  This  approach  is  still  being 
refined  and  is  likely  to  continue  for  some  time  to  come. 

The  advent  of  Large  Scale  Integration  (LSI),  microprocessors,  and  the  present 
wide  assortment  of  memory  devices  has  enabled  better  use  to  be  made  of  the 
telephone  channel,  by  designing  terminals  with  enhanced  capabilities  compared 
with  earlier  versions,  and  these  developments  have  more  or  leas  kept  pace  with 
users  requirements.  But  although  world  wide  communication  is  potentially 
available  through  the  telephone  network,  fundamental  disadvantages  still  remain 
that  make  this  approach  unacceptable  as  a long  term  way  of  providing  data 
services.  However,  it  is  worthwhile  noting  that  the  flexibility  provided  by 
the  use  of  modern  techniques,  makes  it  potentially  easy  to  adapt  terminals 
being  developed  now  to  suit  the  public  dat«  networks  that  will  be  coming  into 
service  in  the  years  ahead. 

This  adaptability  may  not  apply  in  the  case  of  some  other  contemporary  network 
building  equipment.  For  example,  there  now  seems  to  be  a rapid  growth  in  the 
marketing  of  concentrators,  and  other  equipment  incorporating  storage  and 
intelligence,  intended  for  use  in  new  private  networks.  It  seems  likely  that 
the  commercial  life  of  such  equipment  will  be  relatively  short,  because  the 
proposed  public  networks  promise  to  provide  similar  features  on  a more 
extensive  and  favourable  basis,  and  there  are  bound  to  be  strong  pressures  by 
the  Authorised  Telecommunications  Carriers  to  limit  the  licensing  of  future 
private  networks. 


Using  The  Future  Telephone  Network 


The  telephone  network  itself  is  rapidly  being  transformed  by  the  introduction 
of  techniques  based  on  the  use  of  large  scale  integrated  circuits  and  this 
process  will  accelerate.  Digital  transmission  has  been  used  for  a long  time, 
and  PCH  (pulse  code  modulation)  shorthaul  links  using  time  division 
multiplexing  (TDM)  techniques  are  commonplace  in  many  countries.  At  first,  all 
switching  had  to  be  done  at  the  analogue  signal  level  and  analogue  to  digital 
and  digital  to  analogue  conversions  had  to  be  made  at  each  exchange. 

Relatively  recently,  digital  switching  has  been  introduced  where  channels  are 
switched  through  tandem  exchanges  unchanged,  using  digital  logic  and  temporary 
storage  to  delay  PCM  time  slots  in  different  links  to  bring  them  into  alignment 
with  each  other. 

Looking  ahead  the  advent  of  analogue  to  digital  convertors,  using  LSI,  suitable 
for  mounting  in  telephone  instruments  would  bring  the  digital  network  right  to 
the  subscribers  premises.  This  is  not  economical  at  the  present  time,  but 
could  be  so  in  the  next  few  years.  But  it  is  very  unlikely  to  be  available  on 
a really  world  wide  scale  in  the  foreseeable  future,  and  this  is  a strong 
inhibiting  factor  in  the  adaptation  of  this  and  similar  techniques  as  a basis 
for  new  data  networks. 

A further  example  of  the  use  of  LSI  are  the  developments  now  occurring  in 
signalling;  it  is  now  possible  to  use  push  button  dialling  with  the  Strowger 
step-by-step  switching  system,  by  means  of  integrated  circuits  which  store 
numbers  as  they  are  put  in,  and  then  translate  them  into  a sequence  of  pulses, 
similar  to  those  produced  by  the  normal  dial.  However,  this  kind  of  adaptation 
does  not  necessarily  confer  the  uniformity  necessary  for  other  uses  of  the 
telephone  networx,  and  it  may  prove  difficult  to  employ  such  a scheme  in 
conjunction  with  the  voice  response  computer  based  interrogation  systems, 
designed  for  use  with  the  conventional  type  of  rotary  dial,  or  the  push  button 
dial  with  multi-frequency  signalling.  (Ref  1). 


In  sumaary,  the  exploitation  of  the  digital  aspecta  of  the  telephone  network 
for  data  doea,  Initially,  aeen  attractive  and  likely  to  provide  marginal  coata 
for  data  traffic.  But  the  long  delay  before  theae  featurea  become  unlveraal 
and  the  fact  that  they  do  not  neceaaarlly  confer  dealrable  propertied  for  the 
data  uaer  In  a network  primarily  Intended  to  carry  apeech  traffic,  makea  their 
Impact  on  future  data  communlcatlona  rather  leaa  likely  than  at  flrat  alght. 

An  Integrated  Telephone  and  Data  Network 


The  Idea  of  a unlveraal  network  that  would  offer  all  klnda  of  aervlcea  to  uaera 
In  a coherent  faahlon  haa  been  dlacuaaed  for  many  yeara.  The  problem  with  thla 
approach  la  to  predict  uaera'  future  requlrementa,  taking  Into  account  the 
heavy  Inveatment  and  the  long  time  acale  necesaary  to  create  auch  a network. 
However,  there  have  been  varloua  propoaala  for  an  all-dlgltal  data  network  that 
alght,  conceivably,  be  compatible  with  a future  digital  telephone  network,  ao 
that  tranamlaaion  plant  and,  hopefully,  aome  awitchlng  would  be  common  to  both 
networka. 

A number  of  propoaala  have  been  dlacuaaed  within  CCITT  (International  Telegraph 
and  Telephone  Conaultative  Committee)  for  digital  public  networka  and  a 
particular  acheme  haa  been  conaldered  quite  aerloualy  for  Scandinavia  (Ref  2). 
Unfortunately,  there  la  much  preaaure  to  dealgn  new  digital  data  networka  to  be 
almllar  to  the  expected  form  of  the  telephone  network,  even  if  thla  makea  the 
data  network  leaa  aultable  for  ita  Intended  purpoae.  An  example  la  the 
controveray  that  haa  raged  over  the  number  of  bita  to  be  included  in  the  time 
alot  for  each  channel. 

To  accomodate  the  eight  bit  byte  common  In  computer  ayatema,  a propoaal  waa  put 
forward  for  a ten  bit  time  alot,  with  eight  bita  for  data,  one  bit  for  framing, 
and  one  bit  for  atatua,  ie.  to  dlatinguiah  data  from  aignalling  Information 
which  could  be  interleaved  with  the  data.  Thla  la  the  ao  called  eight  plua  two 
bit  acheme.  Unfortunately,  the  TDH  telephone  channel  In  the  USA  la  baaed  on 
the  uae  of  eight  bita  in  a time  alot  with  aeven  bita  for  digitally  encoded 
apeech,  and  one  bit  for  framing:  thla  fact  haa  given  riae  to  the  rival  aix 

plua  two  bit  propoaal  ir>  which  data  Is  carried  In  aix  bit  groupa,  with  framing 
and  signalling  bits  added  to  make  an  eight  bit  time  slot.  (Ref  3) 

A future  digital  network  based  on  a ten  bit  channel  could  have  readily 
accommodated  a seven  bit  speech  channel  with  the  extra  advantage  of  providing 
channel  associated  signalling.  Instead  of  carrying  all  signalling  in  a common 
channel  as  is  the  case  with  PCH  telephony  today.  However,  the  arguments  put 
forward  by  telephony  system  designers,  based  on  the  heavy  investment  already 
made  In  old  equipment,  seem  to  have  moved  the  choice  in  the  direction  of  the 
eight  bit  channel,  to  the  detriment  of  a possible  future  Integrated  data  and 
telephone  network,  which  would  really  meet  the  needs  of  data  users. 

It  now  seems  that  the  plans  of  some  carriers  for  the  provision  of  a digital 
synchronous  data  network  by  the  early  1980's  will  be  delayed  to  the  1990's. 

This  may  be  due  to  recent  announcements  about  public  packet  switching  networks 
which,  if  successful,  may  cause  further  delay  or.  Indeed,  eventually  lead  to 
the  cancellation  of  such  plans.  This  means  that  an  Integrated  network  may 
never  be  possible  on  a world  scale  and  that  telephone  and  data  networks  will 
evolve  Independently,  hopefully  to  meet  the  requirements  of  their  users. 

Packet  Switching  Networka 


Since  It  was  conceived  in  the  early  sixties  (Refs  4,5),  the  technique  of  packet 
switching  for  data  communications  has  developed  rapidly;  this  has  given  rise  to 
experimental  networks  in  various  parts  of  the  world,  and  we  are  now  at  the  dawn 
of  an  era  when  an  Increasing  number  of  public  packet  switching  networks  will  be 
Introduced  by  various  communications  carriers. 

Until  quite  recently,  many  of  the  ideas  under  diacussion  In  connection  with  the 
experimental  networks  seemed  to  be  Incompatible,  and  the  prospect  for  the  user 
seemed  bleak.  But,  following  a quite  unprecedented  degree  of  cooperation 
between  various  network  authorities,  there  has  been  a convergence  between  the 
different  ideas,  with  the  result  that  two  basic  kinds  of  service  are  now  seen 
as  possibilities  for  public  networks  of  the  future.  Existing  experimental 
networks  seem  likely,  in  many  cases,  to  adapt  themselves  to  accommodate  one  or 
other  of  these  basic  services,  which  have  become  known  as  the  datagram  service, 
and  the  virtual  circuit  or  call  service. 

An  example  of  the  datagram  type  of  network  is  the  EIN  (European  Informatics 
Network)  (Ref  6),  while  the  virtual  call  network  is  typified  by  the  Euronet 
network  (Ref  7)  which  will  use  switching  centres  identical  in  design  with  those 
of  EIN,  but  with  additional  front  end  processors  to  implement  a virtual  call 
service.  The  use  of  a datagram  network  as  a basis  to  carry  a virtual  call 
service  is  also  apparent  in  other  networks  such  as  Telenet  in  the  USA  (Ref  8), 
and  Datapac  in  Canada.  The  similarity  in  architecture  between  these  various 
networks  could  provide  various  modes  of  Interconnection  and  this  may  prove  to 


be  very  Inportant  for  international  data  coaaunioations  in  the  future.  Jt  is 
worthwhile  looking  in  wore  detail  at  these  two  kinds  of  service,  since  it  is 
likely  that  users  on  a worldwide  basis  will  be  greatly  affected  by  the  outcome 
of  discussions  that  are  now  taking  place. 

The  datagram  service  is  basically  simpler  than  that  of  the  virtual  call. 

Packets  are  accepted  from  users  and  handled  independently  of  each  other,  so 
that  all  information  for  guiding  a packet  to  its  destination  must  be  contained 
in  a header  accompanying  each  packet.  In  contrarit,  the  virtual  call  service, 
which  also  handles  packets,  has  a preliminary  call  establishment  phase  whereby 
information  la  stored  about  an  anticipated  sequence  of  packets  from  a 
subscriber.  Thereafter,  each  packet  need  have  only  an  abbreviated  address, 
because  the  users  local  switch  adds  whatever  is  necessary  to  pass  it  through 
the  network. 

From  the  users  view  point,  there  may  seem  to  be  little  to  choose  between  these 
services,  but  different  methods  of  implementation  within  each  national  network 
may  yield  quite  different  performance  characteristics.  This  is  a subject  that 
is  now  being  hotly  debated  and  on  which  further  experimental  evidence  is 
required . 

A further  topic  of  debate  is  whether  the  virtual  call  should  be  implemented 
only  in  the  switches,  to  which  users  are  connected,  or  whether  buffers  and 
other  resources  should  be  assigned  to  every  user's  virtual  call  at  each  of  any 
intermediate  switches.  The  latter  scheme  implies  that  the  route  taken  by  a 
call  is  fixed  when  it  is  set  up,  and  means  that  flow  control  can  be  carried  out 
easily,  so  that  congestion  is  much  easier  to  prevent.  On  the  other  hand,  there 
are  advantages  in  dynamic,  rather  than  fixed,  allocation  of  resources  at 
intermediate  switches,  t6  allow  easy  alternate  routing  within  a call,  and  to 
make  better  use  of  network  capacity,  especially  when  the  traffic  flow  is  very 
irregular.  These  topics  should  be  the  subject  of  simulation  and  experiment 
before  vital  decisions  are  taken  that  could  affect  users  for  years  ahead. 

The  Use  of  Satellites 


The  packet-switching  networks  being  built  or  planned  by  various  countries  will 
provide  services  more  appropriate  to  data  communications  than  those  presently 
provided  by  the  telephone  network,  and  will  also  use  transmission  resources 
more  effectively  - a factor  particularly  important  with  long-distance 
international  links.  Unfortunately,  as  the  number  of  nodes  in  a network 
Increases,  considerable  queueing  delays  can  arise  when  sevei al  nodes  are  in 
tandem. 

The  introduction  of  satellite  stations  into  a data  communications  network  could 
enable  direct  connections  to  be  set  up  between  the  extremities  so  that, 
although  a satellite  channel  has  a transmission  delay  of  a quarter-second,  the 
consequence  for  the  integrated  satellite/terrestrial  system  may  be  a faster- 
response  network.  Furthermore,  the  multiple-access  capability  of  the 
communications  satellite  would  enable  any  number  of  links  to  be  established 
without  decreasing  the  transmission  efficiency  of  the  satellite  repeater.  This 
would  make  it  possible  to  use  earth  stations  of  modest  size,  which  are  cheap  to 
build  and  easy  to  locate,  to  set  up  a satellite  network  of  considerable 
capacity. 

Much  experimental  work  has  been  done  in  the  USA  (Ref  9)  on  the  use  of  satellite 
stations  operated  in  various  novel  modes,  that  are  well  suited  for  use  in 
packet  switching.  But  it  is  not  clear  whether  a public  network  would  use  such 
techniques.  The  policy  with  satellite  channels  in  telephone  usage  has  been  to 
treat  them  as  equivalent  to  a terrestrial  link,  so  that  a call  may  be  switched 
through  either,  without  the  user  being  aware  of  which  type  of  channel  he  is 
using.  This  has  constrained  the  design  of  satellite  systems  to  accord  with  the 
needs  of  the  telephone  network,  and  it  remains  to  be  seen  how  this  will 
influence  their  use  for  data  communications. 

Recently  there  has  been  a proposal  to  associate  satellite  earth  stations  with 
the  nodes  of  EIN  to  allow  them  to  exchange  data  via  the  experimental  Orbital 
Test  Satellite  due  to  be  launched  for  the  European  Space  Agency  in  mid  1977. 

The  earth  stations  would  be  small,  unattended  installations,  with  a small 
parabolic  refector  antenna  of  not  more  than  three  metres  diameter,  an 
inexpensive  receiving  system  and  a low-power  transmitter.  Signals  would  be 
transmitted  at  high  speed,  using  time-division  multiple-access  (TDHA),  where  by 
each  earth  station  sends  short  bursts  of  signals,  which  arrive  at  the  satellite 
one  after  the  other,  so  that  the  repeater  transmits  one  signal  at  a time,  with 
consequent  Improved  efficiency  and  reduced  distortion. 

However,  it  could  be  many  years  before  traffic  satellites  assigned  exclusively 
to  data,  with  techniques  designed  to  employ  them  efficiently,  will  become 
available. 


User  System  Aspects 


Unfortunately,  the  provision  of  common  data  connection  services  is  only  a small 
part  of  the  solution  to  the  problem  of  easy  interaction  between  users.  Here  an 
analogy  can  be  drawn  between  communication  among  computers  and  communication 
among  people.  All  people  have  the  same  mechanism  for  speech  and  hearing,  but 
use  a variety  of  different  natural  languages  which  prevent  them  from 
intercommunicating.  The  new  packet  networks  can  give  a universal  transport 
mechanism  for  data,  but  the  very  considerable  differences  between  system 
languages  and  data  structures  used  for  computers  of  different  manufacture  make 
them  largely  Incompatible. 

The  Inherent  incompatibility  between  computer  systems  has  always  been  a serious 
problem  for  users,  but  the  advent  of  data  networks  has  made  it  much  more 
worthwhile  to  find  a solution.  The  current  approach  is  to  define  a set  of 
layered  protocols  which  offer  standard  methods  of  interfacing  at  a variety  of 
levels;  it  then  remains  for  each  of  the  sy;stema  to  be  adapted  to  communicate 
using  these  protocols  in  addition  to,  or  possibly  Instead  of,  their  existing 
methods.  As  an  example,  the  user-oriented  protocols  being  developed  for  the 
European  Informatics  Network  are  shown  in  Figure  1. 

Protocols  fall  broadly  into  two  categories:  those  which  adapt  the 

communications  facilities  to  have  a common  interface  to  users,  and  those  which 
make  users'  systems  mutually  compatible.  The  first  type  are  shown  at  the 
bottom  of  the  figure.  They  are  as  follows 

THE  LINE  PROTOCOL  (LP)  deals  with  problems  of  error  detection  and 
retransmission  and  detects  line  and  switch  failures,  it  is  responsible  for 
ensuring  packets  arrive  error  free  at  the  level  of  the  TRANSPORT  STATION. 

THE  END  TO  END  PROTOCOL  OR  TRANSPORT  STATION  (TS)  is  essentially  a multiplexer 
and  demultiplexer.  It  takes  the  packet  stream  from  the  LINE  PROTOCOL  and 
examines  addresses  in  order  to  separate  the  stream  into  several  substreams 
which  are  used  by  other  levels  of  protocol.  It  provides  services  such  as  the 
ordering  of  packets  in  a substream  and  may  also  create  calls  between  processes 
representing  higher  level  protocols.  The  Transport  Station  exchanges  'letters' 
and  'telegrams'  with  Transport  Stations  in  remote  Subscriber  Computers,  anl  it 
may  Include  facilities  such  as  an  interrupt  whereby  an  urgent  telegram  may  be 
superimposed  on  a call. 

THE  NETWORK  CONTROL  MODULE  (NCM)  has  the  task  of  interacting  with  the 
communications  facilities  and  with  the  network  control  module  at  a remote  site, 
in  order  to  monitor  the  behaviour  of  these  facilities.  The  NCH  may  also 
contain  a mechanism  for  signalling  alarms  to  operators  at  attached  computers, 
and  for  allowing  them  to  control  a communications  network. 

Once  the  end  to  end  protocol  has  been  implemented  at  a Subscriber  Computer  (SC) 
attached  to  a network,  ports  are  available  for  a connection  or  liaison  with 
ports  in  other  SCs.  The  communication  between  these  ports  can  be  regarded  as 
error  free,  because  problems  of  retransmission,  sequencing  etc.  are  taken  care 
of  in  the  lower  levels  of  protocol.  It  is  therefore  convenient  to  regard  each 
pair  of  ports  as  connected  directly  together  with  Just  a delay  inserted  in  the 
path  between  them.  The  line  protocol,  and  the  transport  station  therefore 
greatly  simplify  the  use  of  a communications  network. 

The  second  type  of  protocol , Intended  to  make  computer  systems  more  able  to 
interact  together  are  shown  in  the  centre  part  of  Figure  1.  They  are  briefly 
described  as  follows 

THE  BULK  TRANSFER  FACILITY  (BTP)  is  Intended  to  control  the  transfer  of  large 
quantities  of  data  between  one  port  and  another  and  to  handle  problems  such  as 
the  different  word  length  in  the  machines  that  are  in  communication.  It  also 
has  to  cater  for  recovery  after  breaks  in  transmission,  so  that  the  bulk 
transfer  operation  does  not  have  to  start  again  from  the  beginning  if  it  is 
accidentally  suspended. 

THE  LINE  ORIENTED  PROTOCOL  (LOP)  is  concerned  with  an  interactive  exchange  of 
Information  between  a pair  of  ports.  It  is  responsible  for  maintaining  calls 
between  ports  and  takes  care  of  problems  of  reconnection  if  breaks  should 
occur,  ensuring  that  the  call  is  restored  between  the  correct  correspondents. 

THE  FILE  TRANSFER  PROTOCOL  (FTP)  makes  use  of  the  BTF  to  tranfer  a structured 
file  of  information  from  one  system  to  another.  The  definition  of  a uniform 
structure  for  files  exchanged  between  different  systems  connected  by  a network 
makes  it  possible  to  map  from  the  particular  file  structures  used  by  each 
system  onto  the  standard  FTP. 
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THE  REMOTE  JOB  ENTRY  PROTOCOL  (RJE)  may  be  based  on  the  standard  FTP.  It 
allows  Jobs  to  be  acconpanled  by  an  appropriate  'Job  control  card*  to  a remote 
site,  and  the  results  of  the  Job  can  be  eventually  received  back,  also  in  the 
fora  of  a standard  foraat  file. 

THE  INTERACTIVE  TERMINAL  PROTOCOL  (ITP)  is  based  on  the  existence  of  a line 
oriented  protocol.  It  allows  the  terminals  wishing  to  communicate  through  a 
network  to  do  so  with  a standard  set  of  commands  and  responses. 

THE  VIRTUAL  TERMINAL  (VT)  is  able  to  perform  an  arbitrary  set  of  terminal-like 
operations,  typical  of  a range  of  real  terminals  that  might  be  connected  to  the 
network.  Handlers  in  each  of  the  centres'  systems  can  be  written  to  manipulate 
the  virtual  terminal  and  this  allows  the  interaction  between  services  in 
Subscriber  Computers,  and  terminals  that  communicate  with  them  through  the 
subnetwork  to  be  made  in  a standard  way.  It  is  then  necessary  to  associate  a 
terminal  handler  with  each  type  of  physical  terminal,  in  order  to  translate  the 
real  terminals*  commands  and  responses  into  those  defined  for  the  virtual 
terminal . 

A NETWORK  CONTROL  LANGUAGE  (NCL)  or  network  command  language  is  highly 
desirable  to  allow  users  to  communicate  with  the  network  services  in  a standard 
manner.  It  is,  however,  difficult  to  define  a universal  language,  and  a number 
of  application  oriented  languages  may  be  the  best  practical  way  to  suit 
different  users. 

SPECIAL  APPLICATIONS  PROTOCOLS  (SAP)  for  more  advanced  forms  of  interaction  may 
be  required  later.  An  example  would  be  a protocol  for  handling  interactive 
graphics  terminals  in  a uniform  manner. 

The  agreement  and  implementation  of  standard  high  level  protocols  similar  to 
those  described  briefly  above  makes  different  systems  look  similar  to  users, 
and  this  approach  could  be  extended  to  still  higher  levels  of  protocol.  For 
example,  there  has  been  a gradual  development  of  techniques  to  allow  data  to  be 
shared  by  a number  of  users,  operating  under  one  single  information  management 
system,  and  it  is  now  becoming  worthwhile  to  develop  distributed  database 
management  systems,  so  that  databases  in  geographically  widespread  locations 
can  be  used  by  a wide  range  of  terminals  and  computer  systems.  The  demand 
created  for  network  based  services,  made  possible  by  developments  of  this  kind, 
may  well  require  considerable  changes  in  the  structure  of  future  large  computer 
systems. 

The  Data  Network  Boundary 


For  historical  reasons,  communications  facilities  are  provided  by  Authorised 
Carriers  (usually  the  PTTs  in  Europe),  while  data  terminals  and  computers  are 
supplied  by  the  computer  Industry.  There  is  no  clear  natural  boundary  in  a 
computer  network  where  communications  give  way  to  computing  and  this  gives  rise 
to  conflict  which  distorts  the  evolution  of  teleprocessing  systems. 

At  first.  Carriers  generally  regarded  data  as  a nuisance,  but  now  are 
increasingly  seeing  the  provision  of  a public  data  network  as  a vital  extension 
of  their  business,  and  are  taking  a major  role  in  the  definition  of  standards. 
The  impact  of  this  on  future  developments  is  not  yet  clear,  but  an  indication 
is  given  by  what  has  happened  with  the  European  Informatics  Network  and  the 
Euronet  Project. 

When  the  European  Informatics  Network  was  defined,  the  Carriers'  discussions  on 
packet  switching  were  mainly  concerned  with  a datagram  service.  EIN  was, 
therefore,  designed  to  give  such  a service,  but  had  some  additional  features 
anticipating  a possible  virtual  circuit  service.  However,  the  main  features  of 
a virtual  circuit  are  provided  by  the  EIN  Transport  Station  protocol  described 
above.  This  protocol  is,  of  course,  implemented  by  software  in  users'  systems. 

By  the  time  Euronet  was  planned,  the  virtual  circuit  definition  had  been 
improved,  and  the  European  PTTs  (who  are  providing  for  Euronet  a communications 
network  to  link 'terminals  to  various  data  bases)  adopted  the  EIN  design,  but 
added  terminal  processors  offering  the  CCITT  X 25  virtual  circuit  Interface 
(Ref  10).  The  first  type  of  protocol  defined  for  EIN  and  described  above,  is 
not,  therefore,  required  by  Euronet  users,  because  X 25  offers  a similar  scheme 
in  the  network  rather  than  in  the  users'  systems.  Thus,  the  boundary  between 
the  communications  facilities  and  the  computing  services  has  moved  towards  the 
user,  during  the  Interval  between  the  design  of  EIN  and  the  design  of  Euronet. 
This  is  illustrated  by  figure  2,  which  should  be  compared  with  figure  1.  As 
the  detailed  design  of  Euronet  continues,  it  seems  likely  that  some  higher 
level  protocols  such  as  the  Virtual  Terminal  may  also  be  implemented  within  the 
terminal  processor.  For  example,  to  handle  character  terminals  a Packet 
Assembly  Disassembly  (PAD)  function  has  been  proposed  to  interface  them  to  the 
packet  network.  This  PAD  can  readily  be  augmented  to  include  simple  VT 
features  suitable  for  a range  of  terminals. 
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The  disadvantage  of  this  trend  is  that  the  opportunity  for  the  user  to  develop 
novel  systeos  nay  be  reduced  if  the  communications  functions  become  stereotyped. 
It  is  therefore  important  to  provide  a means  to  by-pass  the  higher  level 
facilities  offered  by  a network,  and  the  use  of  the  basic  datagram  service  is 
one  method  by  which  this  may  be  done. 

10  Future  Prospects 


The  move  towards  including,  within  a data  network,  facilities  that  initially 

formed  part  of  the  users  computer  system  seems  likely  to  continue.  Until  , 

recently,  the  development  of  big  computer  systems  was  determined  by  the  large  li 

main  frames,  and  the  software  that  their  manufacturers  built  around  them.  The  j! 

early  facilities  for  data  communications  were  therefore  tailored  to  the  needs  .1 

of  such  systems,  within  the  constraints  placed  by  the  use  of  the  telephone  i 

network.  But  the  advent  of  micro  processors  is  allowing  distributed  logic  to 

be  used  increasingly,  and  the  agreement  on  standard  protocols  permits  many  !' 

basic  communications  functions  to  be  realised  more  cheaply  as  common  facilities 

in  a public  data  network,  rather  than  as  part  of  the  users  system.  This  will 

leave  the  users  to  concentrate  on  high  level  functions  that  are  special  to  the 

services  they  use  or  provide. 

The  advent  of  techniques  such  as  Viewdata,  means  that  the  era  of  the  home 
terminal  may  not  be  far  ahead  (Ref  11),  and  it  is  difficult  to  see  how  this  can 
be  provided  and  maintained  other  than  by  the  traditional  carriers.  Hopefully 
this  will  lead  to  intense  competition  within  the  computer  industry  to  provide 
services  that  suit  the  needs  of  the  small  user,  who  should  form  a mass  market 
more  valuable  than  that  of  the  present  large  organisations  that  use  today's 
data  services.  This  will  truly  cause  a revolution  in  the  way  that  future 
teleprocessing  systems  must  develop. 
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Maximising  the  Use  of  an  Information  Service  in  an  International  Environment. 


U.A.  Martin 
European  Space  Agency 
Space  Documentation  Service 

' ESRIN.  Via  Galileo  Galilei 

00044  Frascati.  Rome,  Italy 

■i 

SUMMARV 

The  devefopmeiU  t/ie  ESA  Space  VoamentcuUon  SeA.vic.t  1964  to  date,  li  biueily  ’itvieued.  SOS  databaie 

poticy,  lohich  nuit  iotLiiy  the  needi  oi  both  the  Agency  and  iX6  member  countA^ea,  i6  explained  and 

utilliation  tfiends  ion  all  majoK  databoiei  examined.  Baaed  on  a target  oi  ieti-iuppoat  6ince  1971,  the 

evolution  oi  SPS  choAging  policy  and  the  integaaion  the  recently  introduced  RTC  {remote  terminal  con-  i 

centrator)  are  deioUbed.  The  derivation  oi  databaie  related  coa-ta,  their  potential  ior  reduction,  and  the  J 

uieiul  price  reductloni  uihleh  u)outd  reiult  irom  a ilgnlilcant  Increaie  in  overall  iyitem  load  iactor  are 

outlined.  The  comequencei  and  implicatlom  oi  working  in  an  international  envlroment  are  reviewed  and  I 

the  particular  problemi  data  coimtunlcatloni  in  Europe  are  emphoilied.  The  exponential  growth  in  demand  j 

ior  iniormatlon  iervlcei  oa  indicated  by  recent  projectiom  li  noted,  highlighting  the  need  to  improve  and  i 

a^pitc^^  current  typeo  oi  rervlce;  ongoing  experimental  work  at  SOS  on  an  Integrated  iniormatlon  bate  are  j 

briery  mentioned  Including  tome  thoughts  on  the  multi-lingual  requirement.  j 


1.  INTRODUCTION:  SDS  AND  ESA  RECON 

The  European  Space  Research  Organisation  (ESRO)*  was  established  in  June  1962,  following  the  work  of 
the  European  Preparatory  Connission  for  Space  Research  (COPERS)  on  signature  of  a convention  which 
entered  into  force  in  March  1964.  During  the  COPERS  period  a group  of  experts  had  reconmended  the 
B establishment  of  a comprehensive  European  mechanised  aerospace  information  service  and  the  negotiation 

of  an  information  exchange  agreement  with  NASA.  The  Space  Documentation  Service  (SDS)  was  set  up  in 

1964  and  an  agreement  was  negotiated  with  NASA  in  the  same  year. 

Some  milestones  in  the  development  of  SDS  have  been: 

I 

1965  - pilot  operation  of  computer  retrieval  on  the  NASA  tapes  under  contract; 

I - input  processing:  section  is  created  for  acquisition,  abstracting,  indexing,  and  microfiche 

j production/reproduction  (the  latter  under  external  contract)  of  European  aerospace  literature. 

1969  - the  NASA  File  has  grown  to  over  400,000  bibliographic  items  and  off-line  searches  take  20  hours 
of  computer  residence  time  to  run  a batch  of  30  questions,  the  operation  is  now  prohibitively 
expensive; 

- following  the  development  by  Lockheed  of  the  NASA  RECON  software  under  a NASA  contract,  and  an 
^ evaluation  of  available  packages,  ESRO  also  concludes  a contract  with  Lockheed  (with  NASA 

approval)  for  the  provision  of  the  ESRO  RECON  software; 

ESRO  RECON  is  implemented  on  the  Organisation's  largest  computer,  an  IBM  360/65,  at  the  satel- 
lite tracking  and  control  centre  (ESOC),  Darmstadt,  with  remote  terminals  at  Head  Dffice,  Paris 
and  ESTEC  in  the  Netherlands. 

first  Member  State  national  documentation  centre  instals  direct  on-line  access  to  ESRO  RECON 
from  the  Technology  Reports  Centre,  St.  Mary  Cray,  U.K.; 

first  demonstration  of  long-distance  remote  access  via  RECON  is  sponsored  by  US  AEC  using  an 
ESRO  RECON  terminal  in  Paris  to  interrogate  a computer  at  Lockheed's  Palo  Alto  laboratory  by 
means  of  the  then  new  TAC-5  transatlantic  telephone  cable. 

USAEC  RECON  terminal  at  Washington  ASIS  conference  interrogates  the  ESRO  RECON  database  at 
Darmstadt  via  the  INTELSAT  communicationis  satellite. 

central  database  is  augmented  to  include  COMPENDEX,  METADEX,  GRA  and  Nuclear  Science  Abstracts, 
and  exceeds  one  million  bibliographic  items  on-line; 

SDS  re-locates  to  ESRIN,  Frascati  near  Rome,  and  IBM  360/50  computer  dedicated  to  RECON  and  SOS 
tasks  installed. 

* Uember  Countries:  Belgium,  Denmark,  Trance,  Germany,  Italy,  Netherlands,  Spain,  Sweden,  Switzerland, 
United  Kingdom 
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1973  - 
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1974  - the  first  non-aerospace  database  - CA  Condensates  1$  added. 

1975  - European  Space  Agency  Is  created: 

- for  the  first  time  a leased-llne  terminal  is  Installed  in  the  national  centre  of  a non-Member 
State  at  CNO,  Rabat,  Morocco: 

- 20  remote  high-speed  terminals  now  installed  In  the  Member  States  and  both  TYMNET  and  CYCLADES 
(France)  networks  now  Interconnected  with  ESANET  providing  access  for  the  first  time  to 
Inexpensive,  low-speed  dial-up  terminals:  Installation  of  the  first  SOS  RTC  (remote  terminal 
concentrator)  begins,  this  "first"  again  being  scored  by  the  Technology  Reports  Centre,  St.  Mary 
Cray,  U.K.  INSPEC  database  Is  added: 

- CPU  Is  upgraded  to  IBM  360/65. 

1976  - RTC  at  St.  Mary  Cray  operational  and  the  Installation  of  a remote  medium-speed  line  printer  Is 

agreed,  followed  by  other  RTC  Installations  at  KTH,  Stockholm:  Royal  Library,  Brussels:  0TB, 
Copenhagen:  whilst  discussions  leading  to  the  Interconnection  of  the  CNUCE  (Italy)  and  CTNE 
(Spain)  networks  reach  an  advanced  stage:  exchange  of  letters  between  the  Conmlsslon  of  the 
European  Connunities  and  ESA  for  practical  cooperation  In  establishing  EURONET  Is  negotiated. 
ISI  Science  Citation  Index  and  PASCAL  (Bulletin  Signaletique)  databases  added: 

- 16  IBM  2314  disks  (critical  to  response  time)  replaced  by  6 high-speed  high  density  disks: 

- satellite  communication  now  permits  access  virtually  anywhere  in  the  world,  preparations  made 
for  ESA  RECON  demonstration  In  Bombay  at  the  request  of  UNESCO. 


THE  SOS  CENTRAL  DATABASE 

2.1  SOS  database  policy  has  to  satisfy  two  quite  separate  needs: 

- First  to  ensure  the  availability  on-line  of  those  databases  needed  by  the  Agency*  in  the  sup- 
port of  its  various  projects. 

- Second,  to  satisfy  the  aerospace  and  aerospace-related  Information  needs  of  the  ESA  Member 
States. 

2.2  The  provision  of  computer  searches  of  the  NASA  File  to  the  Member  States  of  ESRO  and  ELOO  was 
originally  the  very  "raison  d'etre"  of  SOS.  Thus  this  database,  to  which  SOS  contributes  input  In 
machine  readable  form,  was  the  first  and  foremost  computer  readable  SOS  resource,  and  it  remains 
so  today, 

2.3  However,  because  no  multi-disciplinary  mission-oriented  Information  system  can  cover  one  category 
within  its  scope  to  the  same  depth  as  a discipline-oriented  service  covering  that  category 
exclusively.  It  is  usually  necessary  to  augment  the  primary  mission-oriented  database  with  a num- 
ber of  discipline-oriented  databases,  each  covering  a part  of  the  main  area  of  Interest. 

For  this  reason  SOS  augmented  the  NASA  File,  as  soon  as  capacity  was  available,  with  the  databases 
listed  at  Table  1 section  A.  Other  available  machine  readable  databases  are  continually  under 
consideration.  In  the  light  of  the  changing  requirements  of  the  Agency.  Most  recently  this  pro- 
cedure has  resulted  in  the  selection  of  the  databases  listed  at  Table  1 section  B,  whilst  Spacelab 
requirements  can  be  expected  to  give  rise  to  further  additions.  The  verification  of  these 
decisions  Is  given  at  Figure  1 which  shows  the  utilisation  trends  In  actual  connect  hours  for 
those  databases  accounting  for  over  85  per  cent  of  Member  States  use  of  ESA  RECON.  It  is  signifi- 
cant that  NASA  recently  adopted  a similar  policy  following  an  Investigation  of  Internal  user  needs 
and  plans  to  add  new  databases  during  1976. 

It  should  be  noted  that  not  all  SOS  databases  are  necessarily  accessible  to  all  users  in  all  Mem- 
ber States.  Access  Is  defined  for  each  Individual  password. 

2.4  Partly  as  a result  of  the  introduction  of  the  policy  of  cost  recovery  for  SOS  and  partly  due  to 
the  overwhelming  demand  for  the  Chemical  Abstracts  database,  the  concept  of  non-aerospace  data- 
bases was  Introduced  calling  for  "sponsorship".  The  sponsor,  who  could  be  the  database  supplier  or 
an  interested  user  group  was  required  either  to  pay  to  ESA  the  full  cost  of  file  creation  and 
subsequent  on-line  storage  and  updating,  or  to  guarantee  to  pay  the  difference  should  utilisation 
be  Insufficient  to  recover  costs.  In  the  case  of  Chemical  Abstracts  the  former  arrangement  was 
agreed. 

By  "the  Agency"  ti  meant  the  Agency's  ataif  at  iti  Head  OHice  in  Panes;  and  ita  eataUiahmenta  at  fSTBC 
Hooaduiijk;  and  ESOC,  Pa/uMtadt;  alao  ita  fvUme  eontnactou,  theiJi  co-cantaactoKa,  and  actentific  gnoups 
aaaociated  uiith  ESA  actentiiic  miaaiona. 
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A.  Databases  chosen  Initially 

to  augment  the  NASA  File 

COMPENDEX 

Engineering  Index 

NTIS 

US  Government  Report  Announcement 

METAOEX 

Metals  Abstracts 

INSPEC 

Physics,  Electrical  & Electronic  Engineering,  Computers  and  Control 

B.  Databases  chosen  more  recently  to  augment  the  NASA  File 

HAA 

World  Aluminum  Abstracts 

ESI 

Environmental  Science  Index 

ISI 

Science  Citation  Index  (Physics,  Chemistry  & Engineering  sections) 

PASCAL 

Bulletin  Signaletique  (Aerospace-related  sections) 

C.  Non-Aerospace  Databases 

CA  CONDENSATES 

Chemical  Abstracts  Service 

2.5  The  physical,  chemical  and  engineering  sections  of  the  ISI  Science  Citation  Index  were  brought 
on-line  during  May  1976.  Over  560,000  references,  from  June  1972  were  made  available.  This  file 
will  be  updated  each  month  under  the  terms  of  a 15  month  experiment  during  which  the  extent  to 
which  SOS  users  augment  traditional  searches  by  citation,  and  search  ISI  by  title-word  or  author, 
will  be  evaluated. 

2.6  The  aerospace  and  related  sections  of  the  French  PASCAL  database  (Bulletin  Signaletique)  were 
scheduled  to  be  on-line  by  mld-1976.  Interest  In  this  database  stems,  not  only  from  the  widespread 
French  access  to  SOS  now  possible  via  the  CYCLADES  network  and  from  Belgium  and  Switzerland;  but 
general  Interest  has  been  expressed  In  this  database  by  many  Member  States  as  a result  of  Its  use- 
ful coverage  of  the  literature  of  many  East-European  countries,  the  Soviet  Union,  and  some  Arab 
countries.  SOS  Is  particularly  Interested  In  the  promised  tri-lingual  version  of  PASCAL,  In 
French,  German,  and  English. 


Fig.  1 KSA  RECON  Member  States  utilisation  trends  1975-76 
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2.7  Accurate  detenal nation  of  the  off-line  capacity  available  for  file  updating  Is  a rather  coapicx 
matter  In  terms  of  the  finite  number  of  databases  whose  retro-file  sites,  update  file  sizes, 
record  sizes,  and  record  complexities  bedevil  any  attempt  to  achieve  a standard  approach.  SOS  has 
begun  to  consider  such  projections,  however,  since  a knowledge  of  the  optimum  database  content  Is 
becoming  necessary  to  proper  planning  decisions.  The  "unit  database*  Is  taken  to  be  about  a half 
million  Items  on-line  updating  at  around  5,000  Items  per  month.  First  order  estimates  Indicate 
capacity  equivalent  to  some  25  to  30  unit  databases,  and,  allowing  for  a degree  of  miscalculation 
and  the  necessary  contingencies,  would  Indicate  an  available  capacity  for  sotae  8 to  10  additional 
unit  databases. 

Figure  2 shows  the  Influence  on  unit  costs  of  file  maintenance  as  more  databases  are  added  within 
available  capacity.  Optimisation  of  this  capacity  will  result  In  a lowering  of  existing  prices  to 
users  and  should  also  contribute  to  Increasing  system  load  factor  thus  Increasing  the  centre's 
turnover. 


NUHBER  OF  UNIT  DATABASES 


Fig.2  Variation  of  database  component  of  Tile  connect  fee  with  number  of  unit  databases  maintained 


3.  SOS  CHARGING  POLICY 

3.1  SOS  has  always  been  required  to  charge  for  services  to  Member  States  users.  Even  for  the  first 
year  of  pilot  operation  In  1965/66  EUROSPACE  participated  In  the  financial  guarantee  required  by 
the  contractor.  At  this  time  however,  the  reason  was  simply  to  prevent  the  annual  costs  of  the 
contract,  which  was  then  the  min  element  of  the  SOS  budget,  from  exceeding  budgetary  provisions. 
Nowadays  the  objective  has  changed  somewhat  and  Is,  quite  simply,  to  recover  that  part  of  the 
total  cost  of  the  ESA  RECON  on-line  search  service  attribute  to  external.  Member  States  use. 

3.2  The  Initial  conditions  of  access  and  prices  to  be  charged  for  the  early  batch  search  services 
carried  out  under  contract  were  formulated  by  the  precursor  of  the  ESRO/ELDO  Documentation 
Consultative  Committee  (DCC),  and  were  approved  by  the  ESRO  Administrative  and  Finance  Comlttee 
(AFC),  during  early  1965.  Under  Its  terms  of  reference  the  DCC  was  required  to  advise  on  the 
prices  to  be  charged,  and  the  Comlttee  has  always  been  consulted  on  all  mtters  relating  to 
charging  policy,  whether  questions  of  principle  or  the  details  of  actual  price  levels. 

3.3  It  was  In  1970  that  the  Director  General's  policy  statement  mentioned  for  the  first  time  the  pos- 
sibility of  placing  SOS  services  to  Member  States  on  a self-financing  basis  and  modest  success  In 
earning  Income  during  that  year  led.  In  1971,  to  the  Introduction  of  the  policy  of  cost-recovery. 
However,  by  early  1972  the  D.G.,  said  that  he  thought  It  necessary  to  recognise  that  If  the 
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strvice  «wre  restricted  to  the  aerospace  coannlty  It  would  never  be  properly  econoalc. 

3.4  Althounh  the  first  external  tenalnal,  at  TRC  near  London,  had  been  In  operation  since  the  begin- 
ning of  1971,  and  two  others  at  CNES  Bretigny,  and  ZLDI  Munich,  were  Installed  during  that  year, 
SOS  notions  of  charging  policy  were  as  yet  rudiwentary  and  the  Initial  financial  arrangenents 
were  of  necessity  soaewhat  ad  hoc. 

By  the  beginning  of  1972,  however,  a working  group  set  up  by  the  DCC  had  begun  to  examine  the 
problem  and  had  already  noted  that  the  practice  of  Member  States  paying  the  full  cost  of  the 
leased  telephone  line  from  ESOC  to  the  national  documentation  centre  was  inequitable  since  those 
countries  located,  by  geographic  accident,  furthest  away  were  penalised.  Based  on  the  limited 
experience  then  available  It  seemed  that  external  users  with  the  possibility  of  serving  a reason- 
ably large  clientele  (e.g.,  national  or  regional  documentation  centres,  and  major  Industrial 
groups)  were  prepared  to  pay  about  20,000  AU  as  an  annual  fee  to  ESRO  In  addition  to  local  line 
and  equipment  costs.  On  this  tentative  basis  the  cost  recovery  gap  (then  estimated  at  about 
400,000  AU  pa)*  could  theoretically  be  closed  If  the  SOS  network  could  Include  about  20  external 
leased-llne  terminals  (LLT)  In  addition  to  about  5 ESRO  terminals;  this  was  thought  to  be  techni- 
cally feasible. 

3.5  At  Its  meeting  In  March  1972  the  DCC  took  Its  first  decision  relating  to  the  charging  policy  for 
direct  on-line  access  to  the  ESRO  RECON  service,  and  approved  charges  of:  11,000  AU  pa  - rent  and 
maintenance  of  terminal  (and  allocation  of  central  resources),  and  22,500  AU  pa  - ESRO  SOS  con- 
nection fee. 

For  this  total  of  33,500  AU  (or  about  $ 40,000)  SOS  was  prepared  to  guarantee  a minimum  of  1200 
hours  RECON  availability,  I.e.,  a unit  cost  of  about  $ 33  per  hour.  These  figures  assumed  that 
ESRO  would  provide  a tele-coamunlcatlons  network  reaching  Into  each  Member  State  requesting  the 
RECON  service,  whilst  the  terminal  was  a sophisticated  device  utilising  a video  display  at  240 
char,  per  sec.,  and  Incorporating  a low-speed  printer.  The  central  database  then  comprised  four 
files  totalling  almost  one  million  bibliographic  Items. 

3.6  The  Identification  of  potential  customers  In  Europe  able,  and  willing,  to  spend  lump  sums  of 
such  aagnltude  for  an  Information  service,  however  advanced,  was  an  uphill  task,  and  by  the  fol- 
lowing year  SOS  had  produced  proposals  for  the  sharing  of  the  connection  fee  over  two  or  more 
terminals.  In  an  attempt  to  lower  the  entry  fee  to  RECON.  Though  this  option  did  enable  a group 
of  Industrial  companies  In  the  Netherlands  to  gain  direct  access  to  SOS,  In  general  these  cost- 
sharing schemes  were  unsuccessful;  what  was  needed  was  a ’pay-as-you-go"  system. 

3.7  Following  the  move  of  SOS  to  ESRIN,  Frascati  In  1973,  the  Installation  of  an  IBM  360/50  computer 
dedicated  to  RECON  support  and  associated  tasks,  and  anticipating  the  Introduction  of  new  com- 
puter statistics  and  accounting  routines  during  1974,  the  concept  of  hourly  file  connect  fees 
had  been  explored  by  SOS  and  a tentative  scale  of  prices  derived  ranging  from  28  to  38  AU  ($  34 
to  46)  which  could  be  applicable  to  both  LLT  and  TTY  terminals.  These  prices  were  approved  by 
the  DCC  In  February,  1974,  and  were  Increased  slightly  at  the  end  of  1975  (see  appendix  3). 

3.8  In  deriving  these  early  file  connect  fees  SOS  did  not  then  have  detailed  statistics  of  utilisat- 
ion nor  breakdowns  of  total  operating  expenditure  In  the  form  needed.  It  is  remarkable,  and 
probably  coincidental,  that  In  the  subsequent  refining  of  the  early  costs,  they  were  not  changed 
significantly.  The  procedure  followed  was  first  to  calculate  the  actual  cost  to  SOS  of  maintain- 
ing a given  file.  Including  lease  fees,  programmer's  time,  computer  time,  mass  storage,  etc., 
then  to  estimate  the  probable  proportion  of  total  on-line  connect  time  spent  on  the  file;  hence 
the  hourly  rate.  No  coament  Is  needed  on  the  primaeval  forecasting  techniques  then  available, 
whilst  the  derivation  of  costs  left  much  to  desired. 

3.9  Forecasting  Is  now  based  on  detailed  statistics  of  file  utilisation  (see  appendices  1,  2 and  4) 
which  enable  us  to  plot  accurate  trends  and  to  project  these  with  considerably  more  confidence 
(Fig.  1).  Paradoxically  we  have  moved  away  from  the  early  attempts  at  detailed  cost  accounting 
to  derive  database  maintenance  costs,  and  now  favour  a simple,  top-down  approach.  Since,  If  a 
file  Is  kept  on-line  but  never  used,  none  of  the  network  resources  and  virtually  none  of  the 
central  computing  on-line  resources  are  used,  the  costs  actually  Incurred  will  be  those  asso- 
ciated with  mass  storage  and  with  periodic  off-line  file  updating.  These  database  related  costs 
are  Identified  as  part  of  the  total  annual  operating  costs  and  are  allocated  across  the  various 
databases  on  the  basis  of  total  file  size,  update  file  size,  average  record  size,  record  com- 
plexity, and  the  number  of  Inverted  file  entry  points.  Since  we  may  In  future  Introduce 
additional  processing  as  part  of  file  updating  this  will  also  have  to  be  accounted  for.  Current 
SOS  prices  are  shown  at  appendix  3,  and  an  example  of  a computer-prepared  monthly  Invoice  for 
an  LLT  user  Is  shown  at  appendix  5.  Note  that  the  amount  in  AU  Is  converted  to  the  member 

Uhitst  opeAoting  RECON  on  the  ESOC  computeA  (U  a modeAote  pAcoAlty  task. 


country's  national  currency  of  payment.  A separate  statement  of  royalties  incurred  is  prepared 
(appendix  7)  showing  an  amount  in  US  dollars  (for  all  databases  to  date). 
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3.10  Up  to  now  the  figure  of  3S.OOO  to  40,000  connect  hours  annually  has  been  the  basis  assumed  for 
cost-recovery  and  all  file  prices  are  calculated  on  this  basis.  By  the  end  of  1975  installed 
network  capacity  had  reached  about  35,000  hours,  10,000  of  which  was  used  by  the  Member  States, 
and  6,000  by  ESA  and  SOS.  By  the  end  of  1976  network  capacity  will  comfortably  exceed  100,000 
hours.  Assuming  that  the  load  factor  does  not  change  significantly  some  30,000  hours  of  Member 
States  use  could  result.  Ue  believe, however,  that  the  capacity  of  the  SOS  central  computer  is 
greater  than  100,000  hours;  if  we  could  push  utilisation  up  towards  this  figure  there  is  a 
tremendous  scope  for  really  significant  price  reductions.  An  important  factor  in  achieving  this 
goal  will  be  the  addition  of  more  databases  to  increase  the  load  factor. 

3.11  As  the  concept  of  networking  and  inter- networking  gains  ground  and  is  realised  operationally  we 

shall  see  a move  away  from  a small  number  of  centralised  on-line  information  centres  to  a 
proliferation  of,  perhaps  smaller  or  more  specialised,  centres.  It  is  to  be  hoped  that,  in  the 
complicated  planning  of  such  information  complexes  the  optimisation  of  the  indi- 

vidual centres  will  not  be  overlooked  in  the  attempt  to  optimise,  at  a higher  level,  the  overall 
system  represented  by  the  networks  and  interconnected  hosts.  If  they  are  overlooked  it  is  conceiv- 
able that  any  economies  iwhich  might  have  been  realised  as  a result  of  physical  networking  will  be 
offset  by  the  prices  charged  by  the  hosts  in  their  attempts  to  achieve  financial  viability. 

As  an  extreme  example  of  what  could  happen,  current  SOS  prices,  which  are  expected  to  result  in 
operating  cost-recovery  quite  soon,  range  from  32  AU  to  56  AO  ($  42  to  73)  per  connect  hour , the 
lowest  price  being  for  the  NASA  File.  If,  as  has  been  suggested  quite  seriously,  SOS  were  to 
remove  all  databases  other  than  the  NASA  File  we  calculate  that  it  would  be  necessary  to  charge 
135  AU  per  hour  ($  176)  to  recover  costs.  In  fact,  of  course,  such  a price  would  be  counter- 
productive and  in  practice  it  could  prove  quite  Impossible  to  become  financially  viable.  * 

Conversely,  If  more  databases  are  added  unit  costs  of  file  maintenance  will  reduce  whilst  system 
load  factor  can  be  expected  to  increase.  Under  these  conditions  It  Is  conceivable  that  existing 
prices  could  be  halvedl 

3.12  The  introduction  of  the  RTC  (remote  terminal  concentrator)  by  SOS  recently,  permitting  access  by 
low-speed  HY  data  terminals,  has  given  rise  to  new  aspects  of  charging  policy.  The  arrangements 
approved  by  the  OCC  were  that  the  national  centre  Installing  an  RTC  shares  the  hardware  and 
Installation  costs  equally  with  SOS,  (i.e.,  a non-recurring  cost  of  10,000  AU);  and  provides 
local  telephone  lines  and  low-speed  modems.  The  centre  issues  contracts  to  its  users,  handles 
Invoices  (prepared  centrally  by  SOS,  see  appendix  6),  and  is  responsible  for  promotional  and 
training  activities  (though  the  latter  are  supported  by  SOS,  to  the  extent  possible,  particularly 
during  the  first  year). 


4.  THE  INTERNATIONAL  REQUIREMENTS 

4.1  ESA  currently  has  ten  full  Member  States,  and  four  Observer  States  **  two  of  which  are  partici- 
pating In  a special  ESA  programne.  In  addition  SOS  also  has  contact  with  other  non-Member  States. 

In  all  some  nine  different  European  languages  (not  to  mention  cultures)  are  involved!  This  gives 
rise  to  the  need  for  multi -lingual  subject  specialists  to  abstract  and  index  documents  destined 
for  announcement  in  NASA  STAR.  Ue  ensure,  however,  that  these  people  do  not  spend  more  than  a 
proportion  of  their  time  on  input  processing  tasks,  leaving  them  free  also  to  carry  out  user 
liaison  functions.  These  activities  are  diverse  and  include  a particularly  Important  ESA  RECON 
advisory  and  training  role,  first  when  a Member  State  is  considering  which  new  services  it  could 
introduce  nationally  based  on  SOS,  and  secondly  in  support  of  promotional  and  training  seminars  in 
connection  with  dial-up  access.  Language  barriers,  though  potential  obstacle  have  never  in  fact 
presented  a serious  difficulty  to  SOS.  With  the  establishment  of  SOS  "windows*,  i.e.,  national 
centres,  in  Member  States  the  problem  tends  to  recede  even  further. 

4.2  The  GNP  and  population  of  ESA  Member  States  vary  significantly,  resulting  in  widely  varying 
requirements  in  the  level  of  services  needed.  This  spectrum  of  need  has  led  to  the  early  instal- 
lation, followed  by  multiplication,  of  leased-line  terminals  giving  a very  high-level  service  in 
some  countries,  whilst  others  until  very  recently  could  not  Justify  on-line  access  at  all.  However, 
the  availability  of  dial-up  access  at  very  low  cost  now  enables  all  countries  to  gain  access. 

* aee  3.3 

**  ktAVuA,  Canada,  Vuland,  and  Hoamt/.  The  Repubtcc  oj  luJtand  signed  the.  ESA  convention  in  Vecembea, 

1973  and  Mitt  become  the  eleventh  membea.  of  the  Agency  once  the  new  Convention  it  aatified  by  all 
exitting  mejnbea  itatea. 


An  extreiwly  laportant  area  of  responsibility  for  the  DCC  has  been  to  advise  on  the  projection 
of  SOS  services  throughout  the  Neiiber  States.  Almost  Inevitably  this  led  a number  of  Members  to 
become  Instrumental  In  Introducing  Into  their  countries  direct  on-line  access  to  the  SOS  computer. 

It  will  be  realised  that  as  SOS  services  developed  and  extended,  the  tasks  of  the  Individual  DCC 
members  became  progressively  more  onerous,  as  they  were  called  upon  to  reconcile  the  sometimes 
conflicting  policy  and  financial  requirements  of  (ESRO)  ESA  with  those  of  their  own  national  ser- 
vices. More  recently  the  situation  has  been  complicated  further  by  the  emergence  of  the  EURONET 
concept  within  the  Cotimission  of  the  European  Communities,  which  has  called  for  the  harmonisation 
of  national  Information  policies  In  the  CEC  member  countries  with  the  International  plans  of  the 
CEC  Itself,  and  with  those  of  ESA  whose  member  countries  are  not  wholly  coincident  with  those  of 
the  CEC:  Small  wonder,  then,  that  full  agreement  cannot  be  reached  overnight. 

4.3  It  has  been  the  experience  of  SOS  that  the  DCC  has  functioned  quite  exceptionally  well,  never 
lacking  the  necessary  goodwill  and  understanding  to  achieve  joint  agreement  and  has  been  a very 
positive  example  of  how  international  cooperation  can  result  in  the  realisation  of  a common,  cost- 
effective  resource.  Following  the  creation  of  ESA,  and  recognising  that  the  extent  of  SDS  service 
throughout  Europe  must  have  a significant  impact  on  the  European  information  environment,  the  DCC 
was  superseded  in  December,  1975,  by  the  ESA  Documentation  Advisory  Group  (DAG),  a delegate  body 
which  met  for  the  first  time  in  June  of  this  year. 

4.4  The  first  network  interconnection  undertaken  by  SDS  was  the  unidirectional  link  with  TYMSHARE 
which  enabled  TYMNET  users  to  access  the  SDS  databases  at  Frascati  and  provided  the  first 
experience  of  dial-up  access  to  ESA  RECON.  It  is  interesting  to  note  the  policy  adopted  by  one 
member  country,  Switzerland,  which  has  opted  to  use  the  Lausanne  TYMNET  node  as  the  Swiss  national 
access  point,  monitoring  traffic  density  to  provide  data  on  which  to  base  any  future  decision. 

In  contrast  the  decision  of  France  was  to  connect  the  French  national  CYCLADES  network  with  SDS 
and  this  network  now  provides  the  approved  means  of  access  to  SDS  for  French  users,  though  some 
established  TYMNET  users  in  France  insist  on  maintaining  their  use  of  this  latter  route  at  pre- 
sent. Similar  interconnection  is  now  under  development  with  the  CNUCE  Italian  Universities  Net- 
work, whilst  possible  interconnections  with  the  Spanish  CTNE  and  Nordic  SCANNET  networks  are 
under  discussion. 

A dialogue  has  long  been  in  existence  between  the  Conmission  of  the  European  Communities  and  ESA 
and  has  resulted  recently  in  the  drafting  of  letters  of  agreement  relating  to  scientific  and 
technical  Information  policy  in  the  two  organisations.  SDS  is  scheduled  to  become  one  of  the 
first  two  hosts  to  be  connected  to  EURONET,  the  other  being  the  Deutsches  Institut  fflr  medfzinische 
Dokumentatlon  und  Information  (DIMDI)  at  Cologne.  It  has  been  established  that  the  minimum  service 
to  be  provided  through  EURONET  from  ESA  will  be  based  on  those  databases  on-line  from  SDS  at  the 
date  of  the  first  operational  service  through  EURONET. 

4.5  Data  communications  pose  special  problems  in  Europe,  not  only  because  of  the  very  high  cost  of 
line  rentals  as  compared  with,  say,  the  United  States,  and  the  heavy  surcharges  imposed  should 
the  user  be  able  to  improve  the  efficiency  of  line  utilisation  by  operating  at  higher  speeds,  but 
also  because  of  the  problems  involved  in  attempting  to  co-ordinate  the  efforts  of  so  many  (cur- 
rently 11  for  SDS)  autonomous  P.T.T.'s,  each  of  which,  in  spite  of  CEPT,  offers  services  of  dif- 
fering quality.  Interprets  conmon  standards  in  the  most  unconmon  manner,  and  conmunicates  in  a 
different  language,  no  one  of  which,  it  appears  at  times,  is  an  adequate  basis  for  communication 
between  any  two  P.T.T.'s! 

The  international  high-speed  trunks  between  Frascati,  Italy,  and  Darmstadt,  Germany,  are  absolutely 
vital  to  RECON  operation,  carrying  the  bulk  of  network  traffic.  Overall  availability  of  these 
trunks  is  however,  no  better  than  85  per  cent!  This  is  about  the  best  than  can  expected  from 
international  leased  data-circults  in  Europe  at  present.  When  central  computer  and  peripheral 
down-time  is  superimposed  on  data-circuit  outages  the  net  ESA  RECON  system  availability  seen  at 
the  user's  terminal  is  no  better  than  80  per  cent.  Thus  fact  has  prompted  the  commissioning 
recently  of  a study  of  the  reliability  of  the  entire  ESA  RECON  system.  The  study  is  intended  to 
highlight  the  areas  where  improvements  must  be  made  to  realise  the  required  level  of  95  per  cent 
reliability  at  the  remote  terminals,  and  should  identify  not  only  the  faulty  or  unreliable  com- 
ponents of  the  system,  but  also  the  reasons  for  the  failures. 


5.  ADVANCES  IN  BHRIEVAL  TECHNOLOGY 

5.1  It  is  Interesting  to  consider  the  recent  trends  In  Information  use  as  Indicated  by  the  growth  of 
on-line  searching.  Figure  3 shows  the  trend  from  1971  to  1974  In  the  United  States  (figures  due 
to  Prof.  Martha  Williams)  and  In  Western  Europe  (EUSIDIC).  The  U.S.  curve  would  appear  to  be 
exponential.  Figure  4 Includes  two  projections  of  use  In  Europe,  one  obtained  by  superimposition 


of  the  points  of  origin  of  the  Martha  Williams  curve  over  that  for  EUSIDIC,  the  |econd  was  pre 
pared  by  PA  Management  Consultants  under  a contract  with  the  European  Commission  . They  may  be 
considered  to  be  the  result  of  optirolstic/pessimistic  assumptions  but  whichever  Is  nearer  the 
truth,  explosive  growth  In  utilisation  Is  Indicated. 
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Fig.3  Growth  of  interactive  on-line  bibliographic  searches  in  the  USA  and  Europe;  trends 
(USA  Professor  Williams;  Europe  EUSIDIC) 
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Fig.4  Growth  of  interactive  on-line  bibliographic  searches  in  the  USA  and  Europe:  trends  and  projections 
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It  has  been  said  that  ‘the  future  of  the  knowledge  Industry  and  all  Its  components,  without 
exception.  Is  closely  related  to  the  automation  of  Information.  The  unanimous  view  (of  Delphi 
surveys)  Is  that  during  the  decade  1980  • 1990  automated  information  will  entirely  replace  the 
more  or  less  adequate  manual  processes  at  present  transmitting  and  disseminating  knowledge". 
Further:  “The  evidence  shows  that  the  experts  have  not  even  begun  to  give  any  serious  consider- 
ation to  more  complex  applications,  or  to  types  of  services  which  may  become  quite  coninon  In  the 
future.  It  would  seem  that  barely  ten  years  after  the  first  automated  Information  systems  were 
Introduced,  we  are  already  prisoners  of  a fixed  line  of  thought." 


Can  we  meet  this  challenge?  In  common  with  most  other  Information  centres  SDS  currently  treats 
each  database  as  a discrete  Information  source.  The  user,  who  may  need  to  search  several  data- 
bases In  order  to  obtain  an  exhaustive  answer,  must  develop  a different  search  statement  for  each 
database,  making  allowances  for  the  differing  vocabularies.  Indexing  philosophies,  or  possibly  a 
lack  of  Indexing!  The  process  Is  time-consuming.  Inefficient,  and  frustrating  to  the  searcher  who 
runs  the  risk  of  falling  to  select  the  one  database  which  would  be  most  productive.^  It  Is  also 
entirely  artificial  since  there  are  simply  not  that  many  different  kinds  of  Information  (over 
400  computer  readable  databases  have  been  reported  recently^).  The  artificial  nature  of  this 
approach  Is  well  Illustrated  by  the  degree  of  overlap  which  exists  between  many  secondary  ser- 
vices. 


Contractual  arrangements  with  the  database  suppliers,  who  tend  to  charge  royalties  on  the  basis 
of  file  connect  time  and  are  also  understandably  concerned  to  see  that  the  often  unique  features 
of  their  database  should  not  be  lost  or  diluted  In  an  on-line  system,  are  partly  responsible  for 
the  current  unsatisfactory  state  of  affairs. (The  Information  centres  themselves  must  also  take 
their  share  of  responsibility,  however,  since  none  has  yet  produced  a solution. 


5.5 


The  need  for  the  development  of  Improved  techniques  for  the  simultaneous  searching  of  many 
bibliographic  databases  has  long  been  recognised  by  SOS.  However,  earlier  studies  have  served 
only  to  emphasise  the  difficulties  rather  than  to  suggest  possible  solutions.  All  approaches 
tended  to  be  based  on  the  notion  of  cross-linking  thesauri,  a notion  which  Is  operationally 
unattractive. 


5.6  During  1975  SOS  Initiated  an  experimental  project  as  part  of  Its  Investigation  of  the  Integral 
Information  base  concept.^  This  concept  Is  based  on  the  following  requirements: 


- each  source  database  as  received  from  the  supplier  will  be  maintained  as  a discrete  datafile 
on  the  computer,  retaining  fully  Its  Identity .and  any  unique  characteristics 


- each  source  database  may  be  searched  Individually  If  the  searcher  so  wishes 

t 

- the  user  will  see,  through  his  remote  terminal  window,  as  default  option,  an  Integral  Infor- 
mation pool  (Irrespective  of  source  databases)  which  may  be  searched  by  the  development  of 

a single  search  statement  on  a common  Inverted  file 


- the  user  may  also  select  one  or  more  discrete  databases  prior  to  searching,  I.e.,  any  subset 
of  the  total  Information  base. 


Software,  both  off-line  and  on-line,  already  exists  to  support  the  Integral  Information  base 
approach,  but  to  be  realised  In  practice  a common  retrieval  vocabulary  for  all  merged  files  Is 
necessary.  In  addition  to  this  common  vocabulary  each  database  would  be  augmented  with  any  spe- 
cial retrieval  keys  which  may  be  provided  by  the  database  supplier.  A developed  forrn^  of  free- 
term  Indexing  could  provide  this  connon  retrieval  vocabulary. 


5.8 


This  developed  form  of  free-term  Indexing  presupposes  Improvements  In  three  Important  areas. 
First,  that  the  Inverted  file  would  be  derived  using  a machine-aided  approach  In  order  to 
eliminate  debris,  correct  misspellings,  amend  spelling  variants,  standardise  abbreviations,  and 
minimise  the  singular/plural  problem.  Second,  that  term  phrases  would  be  Introduced,  derived 
again  via  a machine-aided  approach,  and  based  on  some  variant  of  siginificance  analysis.  Third, 
that  some  form  of  search-aid  would  be  developed,  comparable  with  the  Related  Terms  feature  of  a 
controlled  vocabulary,  called  Associated  Concepts.  The  derivation  of  Associated  Concepts  could 
be  expected  to  be  on  the  basis  of  statistical  associations  of  terms  actually  existing  In  a large 
database. 


5.9 


As  the  first  phase  of  this  experimental  project  SDS  wished  to  develop  all  the  necessary  techniques 
on  a single  database,  and  preferably  the  one  with  which  we  were  most  familiar.  NASA  obliged  by 
providing  two  years  of  NASA  STAR  abstracts  tapes.  The  experimental  database  Is  already  on-line, 
and  the  Initial  Inverted  file  of  title  and  abstract  terms  has  been  completely  Intellectually 
processed  and  fed  back  to  form  a Reference  Term  File  which  will  be  the  basis  of  the  on-line  ^ 
Inverted  file.  A technique  based  on  the  notion  of  extentlonal  relationships  developed  by  Wessel 
Is  being  used  to  derive  a variety  of  possible  formats  for  the  Associated  Concepts  (RT  analogue) 
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search-aid.  We  may  not  be  able  to  Incorporate  term  phrases  In  this  first  phase.  A group  of 
experienced  ESA  RECON  users  are  being  asked  to  evaluate  the  resulting  capability,  under  the 
direction  of  an  external  co-ordinator  who  will  also  analyse  the  results  of  test  searches  and 
produce  a final  report.  We  shall  thus  obtain  an  objective  assessment  of  the  work  to  date  from 
an  expert  In  the  field. 

If  the  results  Indicate  that  this  line  of  development  Is  worth  following  we  then  hope  to  repeat 
the  exercise  on  a second  database  when  It  will  be  Immediately  possible  to  test  the  full  range 
of  Integral  Information  base  features.  The  results  could  then  lay  the  basis  for  a new  operat- 
ional service  from  SOS. 

5.10  The  conversion  of  an  experimental  success  Into  a useful  operational  service  would  probably  be 
dependant  on  two  main  factors.  One  would  be  the  willingness  of  the  database  suppliers  to 
jointly  seek  alternative  but  equitable  bases  for  the  recovery  of  royalties  or  use  charges. 
However,  If  the  technique  can  be  shown  to  be  a significant  Improvement  In  service  to  the  ulti- 
mate user  we  do  not  believe  that  the  database  suppliers  would  wish  to  block  such  progress. 

The  other  factor  would  be  the  extent  to  which  the  Integral  Information  base  concept  would  be 
compatible  with  the  Increasing  demand  for  multi -lingual  systems.  We  believe  that  a feasible 
approach  would  be  to  have.  Initially,  titles  available  In  all  system  languages  and  to  create 
Independant  Inverted  free-term  files  together  with  auxiliary  search  aids  for  each  system 
language.  It  Is  felt  that  availability  of  the  search  vocabulary  alone  (whether  controlled  or 
not)  Is  insufficient  since  interaction  with  the  database,  so  vital  to  effective  on-line  search- 
ing, would  be  Inhibited  If  titles  were  not  available  In  the  searcher's  own  language.  This 
approach  would  provide  an  Innedlate  multi -lingual  capability.  The  overheads  In  additional  mass- 
storage  could  later  be  reduced  as  Improved  techniques  were  developed. 

5.12  I leave  you  to  judge  If  all  Information  centres  are  necessarily  "prisoners  of  a fixed  line  of 
thought",  and  to  consider  whether  the  apparent  level  of  effort  currently  allocated  to  the 
development  of  new  techniques  In  automated  Information  retrieval  Is  going  to  be  sufficient  to 
enable  us  to  satisfy  an  exponentially  Increasing  demand  for  an  exponentially  increasing  supply 
of  Information  In  all  sectors  and  at  all  levels. 
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Example  of  iumxULy  monthly  itcuUhtuu  of  ESA  RECOM  database  wUtuation 


JPACt:  DOCIIME  N lAP  ION  SfcltVtCfc 
lAY  , It, 

fOTAL  CONNfcCr  TIME  ; 1691H  IM 

riJSTOMER  CONNECT  TIME  : 


ILE 

TIME 

i 

0 

OH 

2M 

0.0 

1 

202H 

S2M 

25.8 

2 

200H 

26  n 

25.6 

3 

34H 

16M 

4.4 

4 

77H 

38M 

9.9 

5 

111 

20M 

0.2 

6 

79H 

78 

lOiO 

7 

14H 

33M 

1.8 

U 

126R 

40M 

16.1 

9 

5H 

54H 

0.7 

10 

5H 

9B 

0.7 

11 

10H 

8H 

1.3 

14 

OH 

in 

0.0 

17 

20H 

16H 

2.6 

19 

OH 

I7n 

0.0 

32 

8H 

34N 

1.1 

782H 

•>in 

100.0 

1 


f 

% 
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I 

« 


Exampfe  o6  monthly  hwmoKy  itatutica  oi  ootAoJU  ESA  KECON  iyitem  uUUitUion  by  teaied-line  and 
dial-up  teAminali. 


$h>ACE  COClIPilMTM'KiN  mJVlCk 
(NUHbBBS  IN  l‘r:l<CEN'l') 


MAY  1976 


UASED  LINE  ThAFFIC 


>OBT 

LOGONi. 

liHFAKS 

CONHANUS 

INVALID 

CONNECT  TINE 

AO  SPAIN 

1 1.2 

b.U 

7.4 

8.5 

4.2 

40  ROME 

3.0 

10.6 

9.3 

2.8 

13.6 

4 3 ROME 

2.7 

10.6 

1.6 

1.0 

9.  1 

A7  GERMANY-SCANDINAVIA22.0 

22.9 

28.8 

50.  1 

26.5 

A1  NETHERLANDS 

20. « 

15.6 

15.9 

11.0 

14.3 

»6  GREAT  BRITAIN 

20. S 

9.2 

16.  1 

12.6 

9.2 

A3  FRANCE 

b.U 

17.9 

7.3 

9.1 

15.7 

AS  MOROCCO 

3.5 

4.  1 

5.4 

3.3 

3.5 

A 2 BELGIUM 

3.7 

1.U 

2.0 

0.9 

1.2 

44  ROME 

3.7 

2.3 

6.2 

0.7 

2.7 

TOTAL 

58.3 

81.6 

76.2 

86.6 

82,5 

TYHSHARE  TRAFFIC 


PONT 

LOGONS 

BREAKS 

COHHANDS 

INVALID 

CONNECT  TIME 

6A  ) 

79.0 

100.0 

82.3 

79.5 

82.8 

68  I 

17.7 

0.0 

15.9 

15.9 

15.5 

6C  J 

3.2 

0.0 

1.8 

4.5 

1.7 

TOTAL 

4.0 

1.5 

2.1 

0.9 

1.2 

OlAL-IlP  TRAFFIC 


»OHT 

LOGONS 

BREAKS 

COHHANDS 

INVALID 

CONNECT  TIHE 

9A 

BRUSSELS 

0.9 

0.0 

0.3 

0.5 

0.3 

94 

DARMSTADT 

51.5 

40. C 

50.7 

37.3 

44.5 

97 

BRUSSELS 

0.2 

0.0 

0.0 

0.0 

0.0 

98 

, 

0.4 

0.0 

0.1 

0.0 

0.0 

99 

, 

0.4 

0.0 

0.2 

0.0 

0.1 

98 

ROME 

5.9 

6.7 

6.7 

8.6 

6.9 

95 

DARMSTADT 

9.0 

1 3.  3 

10.1 

7.0 

9.5 

62 

DARMSTADT 

1.4 

0.0 

1.9 

1.6 

2.0 

60 

ROME 

12.3 

26.7 

14.3 

25.9 

17.8 

64 

ROME-1 200 

9.2 

0.0 

7.8 

6.6 

8.6 

92 

ROME-1 0 

6.7 

13.3 

6.9 

8.  1 

9.4 

6E 

DATEX 

2.2 

0.0 

1.0 

2.2 

0.9 

TOTAL 

1 /.8 

5.6 

9.4 

3.8 

6.5 

O.T. 

C.  TRAFFIC 

30HT 

LOGONS 

Breaks 

COHHANDS 

INVALID 

CONNECT  TIHE 

A7 

STOCKHOLM 

4.7 

0.0 

3.1 

9.2 

A6 

LONDON 

60.4 

66.7 

76.  1 

80.9 

A4 

CYCLADES 

35.0 

33.3 

20.8 

9.9 

( 

I 

I 

* 


12.4 


» 


'TOTAL 


m.9 


11.2 


B.7 


9.8 


3.  PRINT  CHAKGE 

A fee  of  AU  0,05  will  be  charged  for  each  reference  printed 
"off-line"  (except  for  format  1). 


4.  OTHER  CHARGES 

i)  For  the  Chenical  Abstracts  Condensates  Flla  an 
additional  fee  of  AU  10.  per  hour  is  due  to 
Stlchting  Nederlandse  Inforaatle  Combinatie  (NIC). 

ii)  For  dial-up  access  via  the  SOS  network  exclusively,  a 
connect  fee  of  AU  6.  per  hour  is  applicable. 

iii)  For  dial-up  access  via  the  Tynshare  network,  a connect 
fee  of  AU  9.  per  hour  is  applicable. 

iv)  For  dedicated  terminals  an  equipment  rental  5 maintenance 
charge  of  AU  11.000  per  contractual  year  is  applicable. 
The  cost  of  the  domestic  lines  is  at  the  users'  charge. 


5.  PAYMENTS 

All  charges,  except  the  royalties,  will  be  payable  in  national 
currency  at  the  official  ESA  rate. 

For  1076  the  ESA  exchange  rate  of  1 AU  is 

Royalties  will  be  invoiced  and  payable  in  US  Dollars. 

Rental  of  equipment  shall  be  payable  in  advance;  other  charges 
will  be  invoiced  monthly. 


Invoices  shall  be  paid  within  one  month  of  the  data  of  issue 


Appejitdix  4 


Example  monthly  ESA  RECON  utltUation  KepoJit  pn.epoA.ed  ion  each  database  supptien. 


■ K --  ^;PA?E  DOCmENTHTION  '5EHVIC!'  - TOTil.  PILE  UTILISATION  FOR  RECON  USB 

PERIOD:  BAT  1976 

LS  1 NASA  REFERENCE  HO.:  76/ROT-OH 

DATE  : 1976/  6/31 

SA,  SCIFN.  f.  TECHN.  INF.  0«'?IC2 
•?rc<5  OF  INDt/STPT  AFFAIRS 
D TECHNILOOY  UTILIZATION 


SMINOTON  20SU6,  O.S.A. 

■TN;  NR  A. A.  PBSIBONE,  HEAD  DI3TR.  R EXC.  BATE  PER  HOUR:  OSS  0 


1 

1 2 

1 

3 

1 

4 

5 

( 

. N ). 

1 USER 

1 

1 

1 

USED  TIBE 

COHULATIfES  | 
1 

TOTAL 
IN  S 

) 

1 

102 

1 

"l 

25 

h 

24 

143 

k 

49  1 

0.0 

1 

133 

1 

1 

23 

h 

16 

114 

k 

17  1 

0.0 

1 

104 

1 

1 

4 

k 

51 

14 

h 

29  1 

0.0 

1 

137 

1 

1 

19 

b 

41 

96 

k 

S3  1 

0.0 

1 

109 

1 

1 

15 

h 

41 

59 

k 

56  1 

0.0 

1 

113 

1 

1 

0 

h 

0 

1 

k 

57  i 

0.0 

1 

111 

1 

1 

2 

b 

42 

18 

k 

47  1 

0.0 

1 

112 

1 

1 

2 

b 

5 

24 

k 

S3  1 

0.0 

1 

m 

1 

1 

3 

h 

29 

47 

h 

33  1 

0.0 

* 

116 

1 

1 

31 

h 

32 

145 

h 

18  1 

0.0 

' 
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Appendix  5 


ExonpCe  oi  computeA.  geneAoted  monthly  invoice  pnepoAcd  ioA  ESA  RECOM  tejued-tine  tvminat  meA. 


>55*  --  SPACE  OOCOPESTATIOM  SERVICE  - TOTAL  PILE  ACCESS  FOR  RECOR  OSE 

PERIOD:  3 HAT  1976  - 26  BAT  1976 

REPERBRCE  HO.:  76/ACC 

DATE  : 1976/  6/31 


VIA  DISPLAT 


1 1 

1 2 1 

3 1 

1 * 

1 

5 1 

6 

1 

1 FILES 

1 MO.  DBSCRIPTIOH 

1 USED  TIHE  1 
1 1 

COBOL  ATITES  I 
1 

1 OTHERS 

1 

1 FEB 
(HOUR 

PER  1 

IM  A.O.I 

TOTAL 

IH  A.O. 

1 

1 1 

H A S A 

13  h 

26  1 

165 

h 

28  1 

1 32 

1 

429.87 

1 

1 2 

CBEHICAL  ABSTR. 

3 h 

4 1 

16 

h 

15  1 

1 46 

I 

141.07 

1 

1 3 

BBTADEX  

0 h 

58  1 

7 

h 

14  1 

1 38 

1 

36.73 

! 

1 4 

COHPBHDEX  

1 A 

30  1 

17 

h 

47  1 

1 44 

1 

66.00 

1 

1 5 

BLBCTROHIC  CAP. 

0 h 

0 1 

3 

h 

24  1 

1 49 

1 

0.0 

1 

1 6 

MTIS  - GRA  .... 

7 h 

15  1 

69 

h 

44  1 

1 38 

1 

275.50 

1 

1 7 

OSAE  - HSA  .... 

0 h 

10  1 

3 

h 

26  1 

1 49 

1 

8.17 

1 

1 9 

IHSPBC  

3 h 

52  1 

35 

h 

48  1 

1 40 

1 

154.67 

1 

1 9 

V*  A*  A*  •••••• 

0 h 

35  1 

1 

h 

45  1 

1 40 

1 

23.33 

1 

1 11 

EHTIB.  SC.  IHD. 

0 h 

14  1 

1 

h 

29  1 

1 44 

1 

10.27 

1 

31  h 

4 

322 

h 

20 

TOTAL  : 1 

1145.60 

1 (»-o.) 

1 

671.46 

1 (I.SG) 

EXTRA 

FEB  FOR  FILE  2 

10  AO/HR 

30.67 

(A. 

0.1 

MOTES: 


1.  COHTHACT  HO.  : 13/73 

2.  OORATIOH  OF  COHTPACT  : FROB  75-11-15  TO  76-11-1* 

3.  HODRS  PORESBEH  IH  COHTRACT  : 0 

4.  CDBOLATITB  BOORS  OSED  OP  TO  26  BAT  1976  : 322  h 20  (TOTAL  COL.  3) 

5.  SPECIAL  CORDITIOHS  : 

6.  ICTOAL  BOORS  BILLED  AFTER  0 BOORS  : 322  h 20 

7.  BTCBAH6B  RATE  : 1 A. 0.  > 0.571  (LS3) 

8.  OTHER  COHBBHTS  :TO  BE  SBHT  TO 


••  VOTE  TO  FILE  2 : THE  HIC  BLBHBHT  OF  AO  10  - FOR  CAC  FILE  IS  DUE  TO  BSRO. 
PLEASE  PAT  TO  : 


« 


AppituUx  6 


Bxjompte.  oi  compute  genvuUed  monthly  -invo-Lct  p^poAed  ^oA  ESA  RECON  (Uat-up  o6eA. 


ESA  --  SPACE  DOCOHENTATIOH 
TESHINAL  HO.  403-0 

ATTN:  DTI  CONTRACT 

SBRTICE 

- TOTAL  FILE 
PERIOD:  3 
REFERENCE 
DATE 

ACCESS  FOR  RECON 
NAT  1976  - 28  HAT 
HO.:  76/ACC 

: 1976/  6/31 

USE 

1976 

VIA  RTC 

I 

1 

1 

2 

1 3 

1 9 

1 

5 

1 

6 

1 

1 

F I 1 E S 

1 OSED  TIRE 

1 COBOL  ATITES 

1 OTHERS 

FEE  PER 

1 

TOTAL 

1 

1 HO. 

DESCRIPTION 

1 

I 

HOUR  IN 

A.U. 

1 

IN  A.U. 

1 

1 1 

NASA 

1 

4 h 

20 

1 8 h 11 

32 

138.67 

1 

1 2 

CHBHICAL  ABSTR. 

1 

0 h 

17 

1 1 h 17 

46 

13.03 

1 

1 3 

RETADEX  

1 

0 h 

3 

1 3 h 42 

38 

1.90 

1 

1 9 

COHPBNDEX  

1 

0 h 

12 

1 0 h 35 

44 

8.00 

1 

1 6 

NTIS  - GRA  .... 

1 

0 h 

15 

1 1 h 19 

38 

9.50 

1 

1 7 

OSAB  - NSA  .... 

1 

0 h 

36 

1 0 h 36 

49 

29.40 

1 

1 8 

IHSPEC  

0 h 

9 

1 2 h 52 

40 

6.00 

1 

1 11 

ENVIR.  SC.  IND. 

1 

0 h 

0 

1 0 h 40 

btt 

1 

5 h 

52 

19  h 12 

TOTAL  : 

1 

207. 3C 

1 (A.U. 

1 

140.05 

1 (LSG) 

EXTRA 

PEE  FOR  RTC 

6 

AO/HR 

35.20  (A.O.) 

EXTRA 

FEB  FOR  FILE  2 

10 

AU/HR 

2.83  (A.O.) 

BOTES: 


1.  CONTRACT  NO.  : A6R. RENT 

2.  OaRATION  OF  CONTRACT  : FROB  76-  3-25  TO  0-0-0 

3.  HOORS  FORESEEN  IN  CONTRACT  : 0 

«.  CORDLATITE  HOORS  OSED  OP  TO  28  RAT  1976  : 19  h 12  (TOTAL  COl.  3) 

5.  SPECIAL  CONDITIONS  : 

6.  ACTUAL  HOURS  BILLED  AFTER  0 HOORS  : 19  h 12 

7.  EXCHANGE  RATE  : 1 A.O.  « 0.571  (LSG) 

8.  OTHER  OOHHINTS  : 


•*  NOTE  TO  PILE  2 : THE  NIC  ELEHENT  OF  AO  10  - FOR  CAC  FILE  IS  DUE  TO  ESSO. 


PLEASE  PAT  TO 


AppetuUx  7 


Example  oi  monthly  itatemoU  oi  KoyattU^,  ox  uie  choxyu,  incMxed  by  ESA  RECON  luex. 


r.Sk  -•  SPACE  DOCOHBMTATIOH  SERfICB 
TBRRINIL  HO.  117 


TOTAL  PILE  OTILISATIOH  FOB  RECON  OSE 
PERIOD:  3 HAT  1976  - 26  HAT  1976 
REFER BHCB  HO.:  76/ROT 

DATE  : 1976/  6/31 


ATTH; 


I 1 |2|314|S|6| 


I '=■  I L B S I OSSD  TIHE  | | | FEE  PIR  | TOTAL 

I NO.  DESCRIPTION  | | | |HOHR  IN  t | IN  S 


I 1 NASA  I 13  h 26  1 | I 0 I 0.0  | 

I 2 CHENICAL  ABSTR.  | 3 h 4 | | I 4 I 12. 27  | 

I 3 HETADEX | 0 h 58  | | I 2 | 1.93  | 

I 4 COHPENDEX | 1 h 30  | | | 3 | 4.50  | 

I 6 NTIS  - 6RA  ....  I 7 h 15  I | I 2 I 14.50  I 

' 7 OSAE  - HSA  ....  I 0 h 1U  I | I 0 I 0.0  | 

8 INSPBC ; 3 h 52  I | I 10  | 38.67  I 

I 9 H.  A.  A I 0 h 35  I | I 0 I 0.0  | 

I 11  ENTIR.  SC.  IND.  | 0 h 14  | | I 0 I 0.0  | 

sss===3pss«sxsssssss«ss  axssssssssssssssssassssxssssxssssssssssssvss  sassssasaaassssssss 

31  h 4 TOTAL  : I 71.87  | 


NOTES: 


1.  CONTRACT  NO.  : 13/73 

2.  DHRATION  or  CONTRACT  : FFON  75-11-15  TO  76-11-14 

3.  SPECIAL  CONDITIONS  : 

u.  OTHER  COHRTNTS  :TO  BE  SENT  TO 


PLEASE  PAT  IN  OSS  TO: 


ACCOUNT  HO.  013.50.11 
ESA/SDS 
BANCO  DI  ROHA 
FILIALE  DI  EON  A 
RONE  - ITAIT 


UNE  BANQUE  DE  DONNEES  DE  BIOMETRIE  HUMAINE 


A.M.  COBLENTZ  - Mattre  de  Conferences  i 
I'Universite  Rene  Descartes  (PARIS  V ) 

Oirecteur  du  Laboratoire  d 'Anthropologic 
et  d’Ecologie  Humaine. 


GENERALITES  ; L ' INFORMATION  BIOMETRIQUE  ET  LA  COSCEPTIOM  DES  EQUIPEMENTS  - 


La  conception  et  1 ' elaboration,  dans  tons  les  domaines  d'activites,  d'dquipe- 
ments  et  de  materiels  de  plus  en  plus  complexes  imposent,  dSs  les  tous  premiers  sta- 
des  des  etudes,  une  reflexion  sur  1 ' organisation  ergonoiaique  de  ces  equipements. 
Celle-ci  doit  tenir  compte  des  caracter istiques  des  populations  d'utilisateurs  appe- 
les  A les  servir  ou  A en  utiliser  les  services,  A des  echdances  connues,  plus  ou  moins 
lointaines . 


De  plus,  et  il  s'agit  lA  d'une  evolution  generals,  les  equipements  et  tout  par- 
t icul lerement  certains  postes  de  conduite  deviennent  de  plus  en  plus  complexes.  Bien 
souvent  ils  doivent  etre  utilises  dans  des  conditions  de  charge  de  travail  telles  pour 
1 ' ut il isa teur , que  si  les  etudes  de  relations  hommee-maohinea  ont  ete  peu  ou  mal  pri- 
ses en  compte  dAs  la  aonaeption,  cet  utilisateur  n'en  obtiendra  jamais  le  maximum 
d'efficacite  au  moment  critique.  Les  consequences  bien  entendu  depassent  largement  la 
simple  notion  d'utilisation,  puisqu'elles  peuvent  concerner  la  sCcurite  d'un  bomme  ou 
celle  d'un  equipage  et  compromettre  le  succAs  de  la  mission. 


Par  aille 
lisation  et  A 1' 
qui,  frAquemment 
quel  ils  ont  Ate 
materiel  ou  de  1 
ques  biomAtrique 
de  variability  d 


urs,  ces  memes  Aquipements 
exportation  et  de  ce  fait 
, ne  sont  plus  celles  habi 
confus.  Il  en  rAsulte  que 
'Aquipement  rencontre  des 
s de  populations  des  utili 
e cette  biomAtrie  et  cela 


sont  trAs  souvent  destines  A la  commer 
appelAs  A Stre  servis  dans  des  conditio 
tuellement  rencontrAes  dans  le  pays  dan 
, mSme  s'il  le  souhaite,  le  concepteur 
difficultAs  A tenir  compte  des  caractAr 
sateurs  potentials  et  surtout  des  aspec 
faute  d' information. 


ci 

ns 

s 

du 

is 

ts 


a- 

le- 


ti- 


Four  les  bureaux  d'Atudes,  les  laboratoires,  les  services  commerciaux,  les  dif- 
ficultes  A prendre  en  compte  ces  ensembles  de  paramAtres  biomAtr iques , lors  de  la  con- 
ception des  Aquipements,  peuvent  se  situer  A diffArents  niveaux  : 

- en  premier  lieu,  la  complexitA  des  problAmes  soulevAs  par  1 ' harmonisation  des 
impAratifs  biomAtriques  et  des  contraintes  techniques,  par ticul iArement  au 
niveau  de  la  conception  de  poste,  apparait  encore  mal  mattrisAeau  niveau  des 
bureaux  d'Atudes,  encore  que  cette  prAoccupation  s'y  dAveloppe  de  plus  en  plus, 

- en  second  lieu,  avant  la  crAation  d'une  Banque  de  DonnAes  de  BiomAtrie,  les 
bureaux  d'Atudes  ou  les  laboratoires,  n'accAdaient  que  dif f icilement  aux  in- 
formations biomAtriques  nAcessaires  A la  solution  des  problAmes  abordAs, 

- enfin,  mSme  en  possession  de  ces  informations,  I'ingAnieur  de  bureau  d'Atude 
ou  le  chercheur  de  laboratoire  observent  gAnAralement  une  inadAquation  entre 
les  donnAes  qu'ils  auront  recueillies  sous  forme  de  moyennes,  Acarts-types , 
percent ilages , relatives  A des  groupes  humains  particuliers  et  les  besoins 
spAcifiques  de  I'Atude  dont  ils  ont  la  charge. 


SOURCES  D' INFORMATIONS  DISPONIBLES  ET  INTERET  D'DNE  BASQUE  DE  DOHNEES  - 

On  ne  dispose  gAnAralement  que  de  deux  sources  d ' informations  : les  publications 
antArieures  et  les  Atudes  spAcifiques. 

2. I - Publications  scientif iques  ou  rapports  techniques  - 

Les  publications  antArieures,  scientif iques  ou  techniques  sont  relativement  ac- 
cessibles,  quoique  peu  de  services  de  documentation  possAdent  une  bibl iographie  con- 
venable  et  complAte  sur  le  sujet.  Un  des  principaux  inconvAnients  de  ce  type  de  docu- 
mentation provient  du  fait  que  les  Achantillons  de  populations  mesurAs  n'ont  souvent 
que  peu  de  rapport  avec  les  utilisateurs  potentials  d'un  matAriel  mis  A I'Atude.  GAnA- 
ralement les  donnAes  recueillies  pour  des  enquires  antArieures  1 ' ont  AtA  dans  un  but 
diffArent  et  leur  choix  ne  rApond  gAnAralement  pas  aux  besoins.  Il  deviant  done  nAces- 
saire  d'envisager  des  corrections  et  des  adaptations. 

Le  bureau  d'Atude  qui  travaille  A 1 ' Alaboration  d'un  vAhicule  de  transport  ou, 
de  faqon  beaucoup  plus  contraignante , sur  un  poste  de  pilotage  d'avion  a peu  de  chance 
de  trouver  les  dimensions  taille~aaeia,  longueur  feaae-genou,  dans  leaquelles  est  pris 
en  considAration  1 ' encombrement  de  1' Aquipement  individual.  On  connatt  beaucoup  mieux 
la  stature  d'un  homme  dAvitu  que  celle  du  mSme  homme  chaussA  et  complAtement  AquipA. 


La  dlBenaion  la  plua  intCreaaanta  1 connatcre  pour  dvaluar  1 ' laplantatlon  d'una 
coaaanda  ne  figura  dans  aucunc  <tuda  prCezisCante  at  I'on  an  ast  rdduit  1 faira  das 
Cvaluationa  hasardauaaa  lorsqu'allas  na  sont  pas  proposCas  par  daa  spCeialistas. 

Enfln,  las  rdsultats  sont  toujours  publics  sous  forne  da  paraaitras  statisci- 
quas  : aoyannas,  Ccar ta-typas , aaslaua  at  alnimua,  parfols  la  parcantilaga  ou  la  co- 
efflciant  da  variation  at  tria  raraaant  la  aatrica  d' intarcorrClations. 

A partir  da  cat  donnCas,  il  ast  axclu  actuallamant , pour  das  non  bioaCtricians, 
da  ta  livrer  au  moindra  calcul  nouvaau  sur  las  dimensions  sans  prendre  la  risque  da 
sa  livrer  i das  approxiaations  abusives.  II  suffit  A cat  Cgard  da  prendre  comma  exam- 
ple la  problCme  da  la  definition  d'un  aodCla  da  population  choisi  par  la  parcantilaga. 
Quelles  sont  las  valeurs  da  variabilitC  (parcantilaga  minimum  a^  maximum)  das  dimen- 
sions corporellet  tegmentairas  d'un  Ccbantillon  da  population  chbisi  A S Z pour  la  va- 
riable etatuT«  ? La  presentation  das  rCsultatt  laisse  parfois  croire  aux  utilisateurs 
qu'il  suffit  da  fairs  la  tomme  das  eiCmants  segmentairat  qui  composent  la  stature, 
choisit  individual lament  A S Z pour  retrouvar  la  population  stature  5 Z an  las  addi- 
tionnant.  On  an  arrive  ainsi  A la  situation  paradoxale  qui  conaitte  A aggraver  lee 
aontraintes  liSee  i la  Biomitrie  I 

2.2  - Etudes  anthropomCtriauas  snCcifiquas  - 


La  deuxiAme  source  da  donnCes  dont'  on  dispose  ast  constituCe  par  das  etudes 
epioifiquee  ou  enquStes  directes  adaptCes  au  problAme  posC  ou  A un  problAme  trAs  voi- 
sin.  Dans  ce  cat,  las  mesures  adCquatas  at  las  calculs  souhaitCs  sont  affectuCs  sur 
un  Ccbantillon  reprCsentatif  da  la  population.  La  coQt  d'una  telle  Ctude  peut  parfois 
parattre  relativement  ClevC  mais  bien  souvent  lorsqu'il  s'agit  d'un  material  relative- 
ment  sophistiquC  sa  realisation  s'avAre  parfaitement  justifies. 

2.3  - Evolution  des  caraetCristioues  biomCtriques  au  cours  du  temps  - 


Une  enquSte  ne  fournit  gCnCralement  des  donnCes  que  sur  la  morphologie  actualla 
d'una  population.  Une  telle  information  ne  satisfait  que  trAs  partiallement  la  plupart 
des  concepteurs  dont  le  material  A 1 'Ctude  ne  sera  mis  an  service  qua  cinq  ans  plus 
tard,  puis  utilise  pendant  une  pCriode  que  I'on  peut  Cvaluer  antra  quinxe  at  vingt-cinq 
ans.  On  doit  tenir  compte  dans  toute  la  mesure  du  possible  des  modifications  da  la  mor- 
phologie at  si  on  le  peut,  de  I 'incidence  des  Cquipemants  individuals  nouveaux  auscep- 
tiblas  d'Stre  mis  A la  disposition  des  utilisateurs. 

3 - LA  BANQOE  DE  DONNEES  DE  BIOMETRIE  HUMAINE  DANS  L'ABORD  DES  PROBLEMES  DE  CONCEPTION 
UIPEMENT  - 


3. 1 - Principe  de  conception  de  la  Banque  de  DonnCes  de  BiomCtrie  Humaine  - 

Les  difficultCs  que  rencontre  le  responsable  d'Ctude  et  de  conception  d'Cquipe- 
ment  sont  trAs  frCquemment  confiCes  aux  laboratoires  specialises,  et  tout  particuliA- 
rement  aux  laboratoires  d'Anthropologie. 

En  gCnCral,  lorsque  des  problAmes  ou  des  questions  sont  soumis  A ces  laboratoi- 
res, qui  assez  souvent  n'ont  dans  leurs  archives  qu'un  capital  insuffisant  d'informa- 
tion  pour  apporter  des  rCponses  complAtes,  ils  se  trouvent  confrontCs  A plusieurs  choix: 

- parfois  ils  rCpondent  qu'il  leur  est  impossible  de  fournir  les  renseignement s 
demandCs  dans  I'immCdiat,  car  1 ' exploitat ion  des  donnCes  existantes  correspon- 
dant  A cette  demands  n'a  pas  Ct€  effectuCe  dans  un  sens  que  les  questions  po- 
sCes  requiArent, 

- parfois,  ils  reprennent  les  donnCes  de  base  d'une  enquete  dont  les  fiches  de 
mesure,  les  cartes  perforCes  ou  les  bandes  magnCtiques  sont  stockCes  depuis 
plusieurs  annCes  en  archives  : une  telle  operation  entrains  trAs  souvent  un 
dClai  de  presentation  des  rCsultats  plus  ou  raoins  considerable  en  relation 
avec  I'activitC  gCnCrale  dudit  laboratoire, 

- parfois  enfin,  pour  reconstituer  le  modAle  de  population  envisage,  le  labora- 
toire interrogC  doit  consulter  les  donnCes  de  diffCrentes  enquetes  antCrieu- 
res  plus  ou  moins  CparpillCes.  II  se  rCvAle  plus  rapide  souvent  de  refaire 
I'enquete  sur  le  terrain,  ce  qui  prCsente  en  outre  I'avantage  d'orienter  le 
travail  sur  la  question  prCcisCment  posCe. 

Pour  pallier  les  difficultCs  qui  viennent  d'Stre  CvoquCes,  pour  rendre  plus  ai- 
sCe,  plus  rapide,  plus  adaptCe,  la  prise  en  compte  des  donnCes  anthropomCtr iques  dans 
la  conception  des  Cquipements,  le  Laboratoire  d'Anthropologie  et  d'Ecologie  Humaine  de 
I'UniversitC  RenC  Descartes  (Paris  V),  avec  le  soutien  de  la  D.R.H.E.,  a crCC  une  Ban- 
que de  Donnies  Internationale  de  Biomitrie  Humaine. 

Le  principe  d'une  telle  Banque  de  DonnCes  de  BiomCtrie  repose  sur  le  rassemble- 
ment  et  la  riutilieation  des  mesures  anthropomCtr iques  individuelles  recueillies  depuis 
des  dCcennies  sur  de  trAs  nombreuses  populations  du  monde. 


II  s'agit  done  de  rCunir  un  capital  compose  du  plus  grand  nombre  possible  de 
donnCes  disponibles  afin  de  les  mettre  A la  disposition  des  laboratoires,  des  bureaux 
d'Ctudes,  ou  des  chercheurs. 


Unc  •xploitacion  rationncll*  et  cfficaca  de  ce  fond  dc  donnCea,  d'origlne  tria 
diverac,  niceaaite  avanC  tout  une  definition  prdciae  dea  meaurea  effectudea  et  le  re- 
cueil  de  toutea  lea  caractSriatiquea  explicitea  ou  implicitea  dee  populationa  dtudidea. 

Lea  donneea  doivent  enauite  etre  enregiatrdea  aoua  une  forme  comprimde,  de  fa- 
(on  i rSduire  lea  tempa  d ' exploitation  et  lea  co&ta  de  traitement  dea  interrogations. 
Gr2ce  i cela  on  peut  enviaager  dea  dchanges  conversationnels  avec  la  Banque. 

Quelques  exemplea  vont  permettre  de  mettre  en  evidence  lea  types  de  problSmes 
ou  de  questions  auxquels  la  Banque  de  Donnees  peut  contribuer  B apporter  des  rdponses. 


3.2  - Examples  du  role  de  la  Banque  de  Donnees  dans  I'abord  de  probldmes  biometriques  - 


3.2.1  - Concept ion_de_goate_et_definition_biometri3ue_de_la_gogulation_utilisa- 
trlce  : 

Lots  de  la  conception  d'un  poste  de  pilotage,  par  example,  il  apparatt  indis- 
pensable, afin  de  satisfaire  le  plus  grand  nombre  d ' ut il isa teur s possible,  de  definir 
les  imperatifs  d' habitability  et  d'atteignabilitS  des  elements  de  poste  grSce  aux  me- 
sures  biometriques  les  plus  appropriees.  Il  conviendra  ensuite  de  connattre  leur  repar- 
tition statistique  et  de  choisir  i I'intdrieur  de  cette  repartition  les  bornes  minima- 
les et  maximales  pouvant  localiser  au  mieux  cette  population  d ' ut il isateurs . En  d'au- 
tres  termes  ces  limites  sont  retenuea  en  function  du  pourcentage  de  la  population  que 
I'on  desire  satisfaire. 

Il  existe  done  3 ce  niveau,  avant  toute  approche  exper imentale  un  ensemble  de 
questions  qui  se  posent  : 

- quelle  est  la  structure  gdnerale  de  la  population  utilisatrice  aotuelle  et  priviei- 
ble  selon  les  echeances  connues  de  mise  en  service  et  d ' ut il i sat  ion  du  materiel  7 
En  particulier  le  niveau  sociocultural  des  utilisateurs  risque-t-il  de  se  modifier 
dans  les  deux  decennies  suivantes  7 

- quelle  doit  Stre  la  constitution  precise  de  la  population  experlmentale  qui  partici- 
pera  en  laboratoire,  sur  maquette,  3 la  definition  des  zones  d'activite,  distance 
d'atteinte,  etc. ..7 

Dans  le  contexts  de  la  Banque  de  Donnees  de  Biometrie  telle  que  nous  I'avons 
conque  et  definie,  1 ' explo ita t ion  systematique  du  capital  de  donnees  individuelles 
ouvre  de  nouvelles  possibilites. 


I 

I 


11  est  devenu  possible  de  calculer,  individu  par  individu,  les  valeurs  les  plus 
probables  de  dimensions  non  mesurees  experimentalement . En  effet,  si  les  dimensions 
segmentaires  de  I'homme  sont  susceptibles  de  variations  considerables  entre  ethnies, 
groupes  prof essionnel s , classes  d'3ges,  etc...,  les  ecarts-types  et  les  correlations 
demeurent  au  contraire  d'une  grande  Constance. 

Avec  une  tres  bonne  approximation,  il  devient  possible  de  reconstituer  3 par- 
tir  d'un  petit  groupe,  bien  etudie  sur  le  plan  biometrlque,  le  profil  morphologique 
complet  d'une  population  representative  des  utilisateurs  d'un  materiel,  meme  si  3 
1 'origins  on  ne  possbde  que  quelques  mensurations  de  cette  population,  3 la  limits 
le  poids  et  la  taille  uniquement. 

Il  devient  egalement  possible  de  regrouper  des  enquetes  d'origlne  diverse  pour 
lesquelles  les  techniques  de  mesure  sont  comparables.  On  obtient  alors  des  £chantillons 
d'effectifs  plus  nombreux  ou  bien,  ce  qui  apparatt  tout  aussi  avantageux,  il  se  rdvele 
possible  de  reerSer,  3 partir  de  donndes  individuelles,  des  sous-£chantillons  homogS- 
nes  de  groupes  humains  particuliers  (selon  des  critSres  gdographiques , socioprof ession- 
nels)  dont  les  effectifs  deviennent  ggalement  suffisamment  importants  pour  etre  pris 
en  considSration  dans  les  £tudes  ergonomiques . 

On  conqoit  en  definitive  que  1 ' ut ilisation  de  ce  capital  de  donnSes  biomStriques, 
provenant  de  multiples  enquStes,  done  de  multiples  populations,  soit  extremement  fruc- 
tueuse.  Elle  permet  de  tenir  compte  avec  la  plus  grande  precision  de  la  variabilite 
des  populations  appeiees  3 se  servir  d 'equlpements  ou  de  materiels,  car  compte  tenu 
de  I'evolution  actuelle  et  des  iddes  en  ce  domaine,  la  conception  des  Cquipements  quels 
qu'ils  soient,  doit  integrer  beaucoup  plus  la  notion  biometrlque  et  biomecanique . 

En  effet,  3 quo!  servirait  un  materiel  ou  un  Aquipement  tr3s  sophistique,  done 
tr3s  coQteux,  si  1 ' ut il isateur , place  en  situation  operationnelle,  comportant  frequem- 
ment  une  charge  physique  ou  perceptive  tr3s  eievee,  ne  pouvait  obtenir  le  maximum  de 
performance  7 

En  outre,  le  temps  de  reponse  aux  questions  posees  se  revBle  trie  bref  pour  les 
questions  simples  et  sens  commune  mesure  avec  la  r£alisation  d'une  £tude  pour  les  pro- 
blSmes  plus  complexes. 

3.2.2  - Yues_jrosBec t ives_ i^_Pre vis ion_d^une_£volut ion_morghologiaue 

L ' intCgrat ion  dans  la  Banque  de  Donn£es  d'enquStes  successives  CtalCes  dans  le 
temps,  mais  rCalisCes  sur  une  mSme  population  autorise  des  extrapolations  sans  risque 
et  des  privisions  concernant  I'Cvolution  morphologique  de  cette  population.  Ceci  appa- 


•> 


ratt  d'autant  plua  Important  qua  I'on  a'attache  i I'Stude 
qua  de  5 1 10  ana  avant  leur  mlae  an  plaea  at  damauraront 

ana. 


d ' Cqulpamanta  qui  sont 
an  aarvica  pandant  IS 


con- 
i 20 


On  aait  combian  la  morphologla  (aouvant  raprCaantda  dana  la  littCrature  par  la  j | 

aaula  atature,  alors  qua  laa  dinenaiona  tranavaraalaa  variant  Cgalemant)  Svolue  dans  j j 

la  plupart  das  populations.  II  faut  ramarqusr  i cat  £gard,  qua  1 ' accroissemant  da  la  j 

tailla  est  li£  d deux  phSnominas,  una  augmantation  rSalla  da  la  taille  da  certaines  | 

catCgorias  (phSnomSnes  d'action  aur  la  dSvaloppamant  at  la  croissanca)  mais  aussi  una  - j 

diminution  du  nombre  da  sujets  da  petite  taille.  On  donna  i cette  (volution  una  in- 
tarprdtation  essentiellement  biologlque  assocKe  (troitement  i das  phdnomines  soclaux 
at  culturela. 

Gr2ce  aux  donndea  morphologiquea , il  deviant  possible  d'obtenir  una  estimation 
relativement  precise  (auf f Isamment  precise  pour  la  finalitd  recherch(e)  da  1 '(volu- 
tion pr(viaible  au  coura  du  tamps  non  saulement  da  la  taille  mais  encore  das  autres 
dimensions  segmentaires.  Cet.ta  estimation  ne  pent  blen  entendu  se  faire  qu'en  fonction 
das  donn(es  toclod(mographlques  at  socioculturelles  relatives  A la  population  (tudi(e. 

On  conqoit  d(a  lore  toute  l'efficacit(  da  I'outil  qua  repr(sente  la  Banque  da 
Oonn(es  da  Biom(trie  antra  las  mains  d'(quipes  comp(tentes  da  laboratoires  ou  da  bu- 
reaux d'(tudes.  II  deviant  possible  da  r(aliser  las  meilleurs  compromis  au  niveau  das 
intarfaaaa  Bomme-Equipement,  sachant  qu'en  mettant  ( l'(tude  un  projet  da  poste  da 
pilotage,  un  v(hicule  da  transport  an  common,  una  machine  outil  ou  un  projet  d'archi- 
tecture,  on  pent  d'embKe  int(grer  las  paramStres  biom(triques  das  future  utilisataurs, 

2 das  (ch(ances  d'au  molns  IS  ans,  avec  un  risque  d'erreurs  pratiquement  nul. 

En  d(finitive,  il  apparatt  clairement  qua  la  liste  das  besoins  at  das  utilisa- 
teurs  potentials  d(passe  largement  la  contexte  das  quelques  examples  qui  viennent 
d'2tre  (voqu(s.  Qu'il  s'agisse  das  domaines  da  l'(quipement  individual,  vetements, 
gants,  masques  da  protection,  das  (quipements  collectifs,  transports,  machines,  v(hi- 
cules  terrestres  ou  a(riens,  da  1 ' architecture  da  I'urbanlsme  avec  las  multiples  as- 
pects qua  recouvrent  ces  domaines,  la  biom(tria  intervient  g(n(ralement  en  amont  da 
touts  r(alisation,  au  niveau  da  la  conception  ; sa  prise  en  compte  acquiert  d'autant 
plus  da  valeur,  qua  las  (qulpemants  conqus  soot  plus  sophistiqu(s,  plus  coSteux  at 
doivant  2tra  servis  dans  ces  (ventualit(s  par  das  personnels  trSs  strictement  sdec- 
tionn(s. 
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4 - DKSCKIPTION  SCHBMATIQOE  DE  LA  STROCTORE  IHFORMATIQOE  DE  LA  BANQOE  - 

La  r(alisation  du  logioial  a (t(  guid(e  par  deux  principes  da  base  : 

- La  riduotion  au  minimum  indiapanaabla  du  volume  d' informations  transf(r(es 
au  coura  d'une  interrogation. 

- La  adparation  trda  nette  du  prooeaaua  d' extraction  das  processus  da  traite- 
ment  atatlstique  das  donn(es  at  la  d(veloppement  d'une  gamma  (tendue  da  mo- 
dalit(s  d'extraction. 

4.1  - R(duction  du  volume  d* inf ormationa  transf(r(es  - 

La  r(duction  au  minimum  da  volume  d' informations  transf(r(es  a (t(  obtenue  gri- 
ce  aux  dispositions  suivantea  ; 

- chacun  das  lots  da  valeurs  corraspondant  aux  diverses  rubriques  d'une  enquSte  est 
rendu  accessible  directament, 

- un  algorithme  simple  da  compression  da  donn(es  permet  da  r(duire  au  minimum  la  pla- 
ce occup(e  par  las  lots  da  valeurs. 

4.2  - La  s(paration  dea  processus  d'extraction  - 

Du  point  da  vue  du  logiciel,  una  Banque  da  Donn(es  peut  se  pr(santer  comma  un 
mode  d'axtraotion  gSndraliaS  das  informations.  11  s'agit  done  d'un  systime  qui  permet 
la  traitement  efficace  da  pratiquement  toutea  las  questions  possibles  dans  la  domaine 
da  la  Biom(trie  at  da  la  Biom(canique. 

La  s(paration  du  processus  d'extraction  du  processus  da  traitement  est  la  pre- 
mi2re  condition  d'une  telle  efficacit(.  L'adjonction  d'un  traitement  nouveau  se  limite 
ici  2 dea  calculs  nouvaaux  aur  das  donn(es  axtraltes.  L'inventaire  dea  poaaibilitia 
d'extraction  est  la  deuxKme  condition  d' ef f icacit(.  S'il  n'est  pas  possible  da  pr(- 
voir  d'embKe,  toutea  las  modalit(s  d'extraction,  il  faut  au  moins  en  pr(voir  un  grand 
nombre  at  faire  an  aorta  qua  la  systSme  d'extraction  soit  ouvert,  c'est-2-dire  qua 
l'adjonction  d'une  nouvelle  poaslbilitd  d'extraction  soit  ais(e. 

4.3  - Organisation  das  programmes  - 

La  logiciel  comprand  deux  groupas  da  programmes . (Cf . sch(mas  suivants). 

4.3.1  - Le_grogramms_d^intarrogat ion  : 

Ca  programme  assure  trois  fonctions  principales  : 


f 


- il  assure  I'aide  d 1 ' interrogat ion  dans  la  formulation  des  questions, 

- il  permet  toutes  les  modalitSs  envisageables  d'extraction  des  donnSes  et 
Sventuel lement  1 ' association  i un  programme  de  traitement. 


- il  permet  enfin,  d'introduire  3 tout  moment  des  possibilitSs  nou'elles  de 
calcul  sur  les  donn^es  extraites. 


4.3.2  - Le_grogramme_de_mise_4_iour  : 

Ce  programme  a pour  but  de  permettre  1 ' introduction  3 tout  moment  des  donndes 
d'origine  diverse,  dans  la  structure  commune  du  logiciel  de  la  Banque. 

Avant  d'introduire  les  donnSes  individuelles  d'une  enquete  dans  la  Banque,  il 
est  ndcessaire  de  ddcrire  au  prealable  sea  divers  constituants,  sur  cartes.  On  distin- 
gue 3 cet  Sgard  des  oonstituanta  gSnSraux  (independent  d'une  enquete  donnde)  et  des 
oonatituanta  apioifiquea , 

En  definitive,  le  logiciel  de  la  Banque  de  Donndes  prSsente  un  double  avantage  : 

- sur  le  plan  de  la  programmation,  un  tel  logiciel  permet,  avec  un  maximum 
d'adaptation  de  faire  face  3 tous  les  types  de  traitements  susceptibles  de 
se  presenter, 

- sur  le  plan  des  donndes,  1 'accumulation  des  informations  comparables  ouvre 
des  possibilicds  nouvelles  de  traitement. 


5 - CONCLUSIONS  ET  PERSPECTIVES  DE  DE VELOPPEMEMT  - 

5.1  - Originalite  de  la  Banque  de  BiomStrie  Humaine  - 

Il  n'apparatt  pas  utile  de  ddvelopper  plus  en  detail  les  possibilitSs  d'inter- 
vention  de  la  Banque  de  Donnees  de  BiomStrie  dans  tous  les  domaines  de  conception 
d'dquipement  et  d'adaptation  de  1 ' environnement  aux  besoins  des  ut  il isateurs . Sa  crea- 
tion, d'ailleurs,  est  nee  de  I'importance  de  la  demande  en  ce  domaine  et  de  I'impossi- 
bilite  dans  laquelle  notre  Laboratoire  et  tres  certainement  de  nombreux  autres  se 
trouvaient  d'y  repondre  faute  de  disposer  d ' un  instrument  de  ce  type. 

La  Banque  de  Donnees  telle  qu'elle  est  conque  apparatt  comme  un  outil  d'utilisa- 
tion  tres  souple  et  complet,  capable  de  traiter  au  moindre  ooOt  de  nombreux  problemes 
d'ergonomie  dans  lesquels  interviennent  les  aspects  de  dimensions  et  de  variability 
morphol ogiques . Il  ne  peut  y avoir  d'ergonomie  de  conception  sans  biometrie. 

Alors  que  traditionnellement  chaque  probleme  pose  se  traduit  par  I'etude  anthro- 
pomytrique  spycifique  d'un  ychantillon  plus  ou  moins  reprysentatif  d'une  population 
donnde,  la  Banque  de  Donn§es  permet,  dans  la  plupart  des  cas,  soit  d'eviter  de  nouvel- 
les enquetes,  soit  de  considyrablement  rdduire  I'ampleur  de  I'enquete  nycessaire.  En 
prenant  les  prycautions  necessaires  il  est  possible  de  constituer  des  ychantillons  re- 
prysentatifs  3 partir  d'yiements  sous-reprysentys  pris  dans  plusieurs  enquetes,  de 
transposer  des  rdsultats  d'une  enquete  biomytrique  3 une  autre  avec  un  risque  d'erreur 
insignifiant  eu  dgard  au  besoin  industrial. 

Sur  un  plan  inf ormat ique , la  Banque  de  Donnees  de  Biomytrie  Humaine  se  distingue 
3 la  fois  des  programmes  statistiques  gyndraux  et  des  Banques  de  Donnyes  classiques. 

Elle  possede  les  memes  capacitys  que  les  programmes  statistiques  gdnyraux  du 
point  de  vue  des  calculs  et  de  la  validation  des  donnyes.  Elle  permet  en  outre  de  met- 
tre  3 jour  des  ensembles  de  fichiers,  de  sdlectionner  des  sous-ensembl es  3 travers  plu- 
sieurs enquetes. 

Par  rapport  aux  Banques  de  Donnyes  classiques,  elle  se  distingue  par  la  simplici- 
ty d ' organisation.  Le  fait  que  les  donndes  individuelles  soient  introduites  par  enque- 
te permet  d'eviter  la  crdation  de  gpoupee  et  par  consdquent  la  hidrarchisation  des 
enregistrements  en  function  de  la  frdquence  supposde  des  demandes.  Lots  d'une  interro- 
gation de  la  Banque,  la  donnde  biomdtrique  recherchde  est  extraite  par  exploration  de 
I'ensemble  des  fichiers.  Le  temps  de  rdponse,  compte  tenu  des  programmes  mis  en  place, 
est  de  I'ordre  de  la  seconde. 

5.2  - Voies  de  ddyeloppement  envisagdes  - 


Les  voies  de  ddveloppement  suivantes  sont  actuellement  envisagdes  : 

- extension  du  fond  de  donndes  anthropomd triques  3 I'aide  de  nouvelles  enquStes  four- 
nies  par  tous  les  laboratoires  ou  possesseurs  de  donndes  biomdtriques  ddsirant  s'as- 
socier  aux  utilisateurs  de  la  Banque.  A ce  jour  les  informations  en  cours  d'introduc- 
tion  dans  la  Banque  reprdsentent  avec  310  mesures  anthropomdtr iques  diffdrentea,  un 
capital  de  1.800.000  donndes  individuelles  provenant  de  diffdrents  laboratoires  eu- 
ropdens  et  amdricains  concernant  des  groupes  humains  nombreux  et  varids  d'Europe, 
d'Afrique  et  d'Amdrique, 


- gdndralisation  3 d'autres  domaines  fondds  sur  le  ddveloppement  d'enquetcs  rdgulidres 
intdressant  les  domaines  de  I'ergonomie  ; rdactions  aux  vibrations,  variabilitd  des 
efforts  ddveloppablea  par  des  opdrateurs  au  poste  de  travail,  etc... 

! 
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- dSveloppement  de  la  programmation  directe  par  lea  util laateura . D'oret  et  ddji  un 
certain  nombre  de  calculs  simples  peuvent  Stre  demandds  directemenC  sur  le  mode 
conversationnel . En  compliquant  les  posaibllitds  de  calcul  et  en  introduiaant  lea 
conditions  logiques,  un  utilisateur  doit  pouvoir  traiter  directement,  sans  grand 
apprent isaage , des  probldmes  relativement  complexes, 

- la  liaison  avec  une  console  de  visualisation  eat  actuellement  envisagSe.  En  effet 
il  est  apparu  au  cours  d ' expdr iences  de  simulation  que  de  nombreux  probldmes  de 
biomdtrie  et  d'ergonomie  - conception  d'engins  et  de  poates  de  conduite  ou  de  com- 
mandes  morphotypes  de  structures  anatomiques  (mains,  face,  pied,  etc...-  pouvaient 
Stre  resolus  avec  un  gain  de  temps  considdrable  par  une  simulation  avec  visualisa- 
tion. 

Dans  certains  problSmes,  le  jeu  de  questions  et  de  rdponses  immddiates  en  mode 
conversationnel  facilite  i..  progression  et  permet,  par  essais  successifa,  d’aboutir  3 
une  solution  satisfaisante. 


De  meme,  la  possibility  de  moddliser  sur  dcran  les  diffSrentes  £ventualitCs  de 
choix  qui  se  prdsentent  gSnyralement , dans  toutes  Studes  de  conception  d ' Squipement , 
en  raison  des  compromis  permanents  recherchda  entre  les  contraintes  techniques  et  les 
aspects  de  la  variability  biologique,  apparalt  dSs  i prysent  u;se  voie  fructueuse  qu'il 
convient  d'exploiter. 
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1.  GENERAL:  BIOMETRIC  DATA  AND  EQUIPMENT  DESIGN 

In  all  spheres  of  activity  the  design  and  development  of  systems  and  equipment  of  ever  increasing  complexity 
make  it  essential  to  give  consideration  at  the  very  earliest  design  stages  to  the  ergonomic  layout  of  such  equipment. 
This  must  take  into  account  the  characteristics  of  the  populations  of  users  who  will  have  to  operate  the  equipment 
or  use  its  services  at  known  dates  in  the  fairly  distant  future. 

In  addition,  and  this  is  a general  trend,  all  items  of  equipment,  and  particularly  certain  types  of  vehicle  driving 
compartments  and  aircraft  cockpits  are  becoming  increasingly  more  complex.  Quite  often  they  have  to  be  operated 
in  workload  conditions  which  are  such  that,  if  the  man-macliine  relation  studies  have  not  been  taken  into  sufficient 
account,  or  have  been  incorrectly  taken  into  account  at  the  design  stage,  the  user  will  never  obtain  maximum 
efficiency  from  the  equipment  at  the  critical  moment.  The  resulting  consequences  will  of  course  go  far  beyond  the 
mere  co-^icept  of  use,  since  they  may  affect  the  safety  of  a man  or  of  a crew  and  compromise  the  success  of  a mission. 

Furthermore,  the  same  equipment  is  very  often  intended  for  marketing  and  export  and  is  thus  required  to  be 
operated  in  conditions  which  are  frequently  not  the  same  as  those  normally  found  in  the  country  in  which  it  was 
designed.  But,  even  if  the  equipment  or  system  designer  wanted  to  take  account  of  the  biometric  characteristics  of 
populations  of  potential  users,  and  particularly  of  the  variability  aspects  of  such  biometry,  he  is  finding  it  difficult  to 
do  so,  simply  because  the  relevant  facts  are  not  available. 

In  the  case  of  design  offices,  laboratories  and  commercial  departments  of  firms,  the  difficulty  of  taking  such 
sets  of  biometric  parameters  into  account  at  the  design  stage  of  the  equipment  arises  at  various  levels; 

- firstly,  the  complexity  of  the  problems  involved  in  harmonising  biometric  requirements  and  technical  con- 
straints, particularly  at  the  design  level,  still  appears  to  have  been  inadequately  overcome  by  design  offices, 
even  though  this  is  a matter  which  is  of  increasing  concern; 

- secondly,  before  a Biometry  Data  Bank  was  set  up,  design  offices  and  laboratories  had  great  difficulty  in 
gaining  access  to  the  biometric  data  required  for  solving  problems  under  investigation; 

- finally,  even  when  he  obtains  these  data,  the  design  engineer  or  the  laboratory  researcher  generally  finds  a 
great  gap  between  the  data  given  to  him  in  the  form  of  averages,  standard  deviations  or  percentiles  relating 
to  particular  human  groups,  and  the  specific  requirements  of  the  design  for  which  he  is  responsible. 

AVAILABLE  SOURCES  OF  INFORMATION  AND  THE  ADVANTAGES  OF  A DATA  BANK 
Only  two  sources  of  information  are  generally  available:  previous  publications  and  specific  studies. 

Scientific  Publications  or  Technical  Reports 

Previous  scientific  or  technical  publications  are  relatively  accessible,  although  few  documentation  services  have 
a suitable  and  comprehensive  bibliography  on  this  subject.  One  of  the  main  disadvantages  of  this  type  of  docu- 
mentation arises  from  the  fact  that  the  population  samples  measured  often  bear  very  little  relationship  to  potential 
users  of  an  item  of  equipment  currently  under  design.  The  data  collected  for  previous  surveys  have  generally  been 
for  a different  purpose  and  their  selection  does  not  normally  fit  specific  requirements,  thus  leading  to  the  need  for 
corrections  and  amendments. 

The  design  office  working  on  the  design  of  a public  transport  vehicle  or,  something  which  imposes  many  more 
constraints,  the  cockpit  of  an  aircraft,  has  very  little  chance  of  finding  the  dimensions  relating  to  sitting  height  or 
buttock  to  knee  length  within  which  the  overall  size  of  the  equipment  has  to  be  designed.  Much  more  information 
is  available  on  the  height  of  a man  without  his  clothes  than  on  the  height  of  the  same  man  with  his  shoes  on  and  ' 
wearing  all  his  equipment. 

The  most  useful  measurement  to  know  when  assessing  the  location  of  a control  never  appears  in  any  existing 
study  and  one  is  reduced  to  making  assessments  which  may  be  dangerous  if  they  have  not  been  made  by  specialists. 
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Finally,  the  results  are  always  published  in  the  form  of  statistical  parameters:  averages,  standard  deviation. 


maximum  and  minimum  values,  occasionally  percentiles  or  the  coefficient  of  variation,  and  very  rarely,  the  inter- 
conelation  matrix. 

At  the  present  time  non  biometricians  are  excluded  from  making  the  least  new  calculation  about  dimensions 
using  the  above  data,  for  fear  of  the  risk  of  arriving  at  a wrong  approximation.  An  obvious  example  of  this  is  the 
problem  of  defining  a sample  population  selected  on  the  basis  of  percentile  values.  What  are  the  variability  values 
(minimum  and  maximum  percentiles)  of  the  segmental  physical  measurements  of  a 5%  population  sample  in  the  case 
of  the  ‘height’  variable?  The  results  are  often  presented  in  a form  which  leads  users  of  the  data  to  believe  that  all 
that  is  necessary  is  to  add  together  the  various  segmental  data  constituting  ‘height’ .selected  on  a S%  basis,  in  order  ' 
to  find  the  height  of  5%  of  the  population.  We  thus  arrive  at  the  paradoxical  situation  of  aggravating  the  constraints 
associated  with  biometry! 

2.2  Specific  Anthropometric  Studies 

The  second  available  source  of  information  consists  in  specific  studies  or  direct  surveys  catering  for  the  particular 
problem  involved  or  one  very  much  akin  to  it.  In  such  case,  suitable  measurements  and  the  desired  calculations  are 
carried  out  on  a representative  sample  of  the  population.  The  cost  of  such  a survey  may  sometimes  seem  relatively 
high  but  very  often  proves  fully  justified  if  the  equipment  concerned  is  relatively  sophisticated. 

2.3  Changes  in  Biometric  Characteristics  over  a Period  of  Time 

A survey  will  generally  provide  data  only  on  the  current  morphology  of  a population.  But  such  information  can 
only  partially  meet  the  requirements  of  the  majority  of  designers  who  are  working  on  equipment  which  will  not  come 
into  service  until  five  years  later,  and  then  be  used  for  a period  estimated  at  between  fifteen  and  twenty-five  years. 
Maximum  account  should  be  taken  of  changes  in  morphology,  together  with,  if  possible  the  number  of  new  individual 
equipments  likely  to  be  available  to  users. 


3.  THE  HUMAN  BIOMETRY  DATA  BANK  AND  EQUIPMENT  DESIGN  PROBLEMS 

3.1  Basic  Principles  of  the  Design  of  the  Human  Biometry  Data  Bank 

Any  difficulties  encountered  by  equipment  designers  are  very  often  referred  to  specialist  laboratories,  and 
particularly  to  anthropology  laboratories. 

Generally  speaking,  because  the  stock  of  data  held  in  their  files  is  very  often  insufficient  to  enable  them  to  give 
complete  answers  to  the  problems  or  questions  put  to  them,  these  laboratories  rind  themselves  having  to  make  one  of 
several  choices  when  replying: 

— sometimes  they  reply  that  they  cannot  immediately  supply  the  information  requested  as  the  existing  data 
relating  to  the  request  have  not  been  analysed  from  the  particular  aspect  required  by  the  questions  asked; 

— sometimes  they  go  back  to  the  basic  data  obtained  during  a survey,  the  relevant  forms,  punched  cards  or 
magnetic  tapes  for  which  have  been  filed  away  for  several  years:  this  operation  very  often  means  that  the 
results  may  take  some  time  to  produce,  depending  on  the  general  workload  of  the  laboratory; 

— finally,  on  some  occasions,  in  order  to  reconstitute  the  sample  population  being  considered,  the  laboratory  to 
which  the  matter  has  been  referred  has  to  consult  data  obtained  from  previous  more  or  less  scattered  surveys. 
I*  often  turns  out  to  be  quicker  to  conduct  the  survey  again  on  the  spot,  and  this  has  the  added  advantage 
that  the  investigation  can  be  directed  to  the  specific  question  raised. 

To  overcome  the  difficulties  just  mentioned,  and  to  make  it  easier  and  quicker  to  take  account  of  anthropo- 
metric data  when  designing  equipment,  and  to  do  so  more  appropriately,  the  Anthropology  and  Human  Ecology 
Laboratory  of  the  University  Ren4  Descartes  (Paris,  V),  with  the  support  of  the  Direction  des  Recherches  et  Moyens 
d’Essais  (Directorate  of  Research  and  Test  Facilities)  has  set  up  an  International  Human  Biometry  Data  Bank. 

The  basis  of  a data  bank  of  this  kind  is  the  collection  and  re-use  of  individual  anthropometric  measurements 
gathered  over  several  decades  on  a large  number  of  world  populations. 

It  is  therefore  a matter  of  assembling  a stock  of  the  largest  possible  number  of  available  data  items  and  placing  them 
at  the  disposal  of  laboratories,  design  offices  and  researchers. 

A primary  requirement  for  rational  and  efficient  application  of  this  data  base,  created  from  various  sources,  is 
a precise  statement  of  the  measurements  made,  and  the  collecting  together  of  all  the  explicit  or  implicit  character- 
istics of  the  populations  studied. 

The  data  must  then  be  recorded  in  a compressed  form  so  as  to  reduce  the  time  required  for  their  application 
and  the  cost  of  processing  interrogations.  Conversational  exchanges  with  the  bank  can  then  be  envisaged. 
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A few  examples  will  illustrate  the  types  of  problems  or  questions  for  which  the  r'  a bank  can  help  to  provide 
answers. 

3.2  Examples  of  the  Part  pbyed  by  the  Data  Bank  in  tackling  Biometric  Problems 

3.2.1  Design  of  an  Item  of  Equipment  and  Biometric  Definition  of  the  User  Population 

When  a cockpit  for  example  is  being  designed,  it  would  seem  essential,  in  order  to  satisfy  the  largest  possible 
number  of  users,  to  define  the  ‘habitability*  (or  environmental)  requirements,  and  the  ‘reachability’  requirements  for 
the  various  components  in  the  cockpit,  using  the  most  appropriate  biometric  measurements.  It  is  then  helpful  to 
know  the  statistical  distribution  of  these  measurements  and  to  choose,  within  this  distribution,  the  minimum  and 
maximum  limits  within  which  to  place  this  population  of  users.  In  other  words,  these  limits  are  adopted  in  relation 
to  the  percentage  of  the  population  whose  requirements  are  to  be  met. 

A number  of  questions  therefore  arise  at  this  level  before  any  experimental  approach  can  be  made: 

- What  is  the  general  structure  of  the  present  and  foreseeable  user  population  according  to  the  known  dates  of 
entry  into  service  of  the  equipment  concerned?  In  particular,  is  the  social  and  cultural  level  of  the  users  likely 
to  alter  during  the  next  two  decades? 

- What  is  to  be  the  exact  constitution  of  the  experimental  population  which  will  take  part,  in  the  laboratory  or 
in  a mock-up,  in  defining  the  areas  of  activity,  reaching  distance  etc? 

In  the  context  of  the  Biometry  Data  Bank  as  designed  and  defined  by  us  the  systematic  use  of  the  stock  of 
individual  data  items  held  in  store  opens  up  new  possibilities. 

It  is  now  possible  to  calculate  for  each  individual  person  the  most  probable  values  of  the  dimensions  which  have 
not  been  measured  experimentally.  In  fact,  although  the  segmental  measurements  of  a person  are  likely  to  vary  con- 
siderably between  ethnic  groups,  professional  groups,  age  classes  etc,  standard  deviation  values  and  correlations  remain 
on  the  other  hand  largely  constant. 

A small  group  which  has  been  subjected  to  a careful  biometric  study  can  help  to  reconstitute  the  complete 
morphological  profile  of  a representative  population  of  users  of  a particular  equipment,  even  though  initially  only  a 
few  measurements  in  regard  to  this  population  were  available,  at  most,  weight  and  height  only. 

It  is  also  possible  to  regroup  surveys  undertaken  for  various  reasons,  but  for  which  the  measuring  techniques  are 
comparable.  This  will  result  in  larger  samples  or,  and  this  would  appear  to  have  just  as  many  advantages,  individual 
data  items  can  be  used  to  recreate  homogeneous  sub-samples  of  particular  human  groups  (based  on  geographical,  social 
and  professional  criteria),  the  members  of  which  are  also  sufficiently  numerous  to  be  taken  into  consideration  for  the 
purpose  of  ergonomic  studies. 

It  will  be  realised,  finally,  that  the  use  of  this  capital  stock  of  biometric  data  gathered  during  a very  large  number 
of  surveys,  and  therefore  on  a very  large  number  of  populations,  can  provide  extremely  fruitful  results.  It  enables 
highly  accurate  account  to  be  taken  of  the  variability  of  the  populations  required  to  use  systems  and  equipment, 
since  in  view  of  present  developments  and  ideas  in  this  field,  the  design  of  equipment  of  whatever  kind  is  bound  to 
incorporate  to  a much  greater  extent  the  concept  of  biometry  and  biomechanics. 

Indeed,  what  would  be  the  good  of  a very  sophisticated,  and  therefore  very  costly  item  of  equipment  if  the  user, 
finding  himself  in  operational  conditions  frequently  involving  great  physical  and  mental  responsibility,  were  unable  to 
derive  maximum  efficiency  from  it? 

Furthermore,  the  time  taken  to  reply  to  questions  is  very  short  in  the  case  of  simple  questions  and  without  rny 
common  measure  with  the  performance  of  a study  for  solving  more  complex  problems. 

3.2.2  Prospection:  Predicting  Morphological  Changes 

As  a result  of  incorporation  in  the  Data  Bank  of  the  results  of  succesive  surveys  conducted  over  a period  of 
time,  but  using  the  same  population,  it  is  possible  to  extrapolate  without  risk  of  error  and  to  predict  the  morpho- 
logical development  of  this  population.  This  would  appear  to  be  especially  important  as  we  are  concerned  with  work 
on  equipment  which  is  being  designed  five  to  ten  years  before  it  is  due  to  come  into  service  and  which  then  remains 
in  service  for  some  fifteen  to  twenty  years. 

The  extent  to  which  morphological  changes  (often  represented  in  the  literature  merely  by  height,  whereas  the 
transverse  dimensions  also  vary)  occur  in  the  majority  of  populations  is  well  known.  It  should  be  noted  in  this 
connection  that  increased  height  is  associated  with  two  phenomena:  an  actual  increase  in  the  height  of  certain 
categories  (phenomena  affecting  development  and  growth)  and  also  a decrease  in  the  number  of  subjects  who  are  small 
in  stature.  These  changes  are  regarded  mainly  as  biological  changes  closely  dependent  on  social  and  cultural 
phenomena. 
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Using  morphological  data  it  is  possible  to  make  a relatively  precise  estimate  (precise  enough  for  the  final  result 
desired)  of  the  foreseeable  development  with  time  not  only  of  stature  but  also  of  the  other  segmental  dimensions. 

This  estimate  can  of  course  be  made  only  on  the  basis  of  the  sociodemographic  and  sociocultural  data  relating  to  the 
population  studied. 

The  effectiveness  of  such  a tool  as  the  Biometry  Data  Bank  in  the  hands  of  competent  laboratory  or  design  office 
teams  can  therefore  well  be  imagined.  It  allows  them  to  achieve  the  best  compromises  for  the  man-machine  inter- 
faces, knowing  that  when  designing  a cockpit  or  a public  transport  vehicle,  a machine  tool  or  an  architectural  project, 
they  can  incorporate  immediately  the  biometric  parameters  of  those  who  will  be  using  the  equipment  at  dates  at  least 
fifteen  years  hence,  and  with  practically  no  risk  of  error. 

Finally,  it  is  clear  that  the  list  of  requirements  and  potential  users  is  largely  outside  the  context  of  the  few 
examples  just  quoted.  Whether  we  are  concerned  with  equipment  for  the  individual:  clothing,  gloves,  protective 
masks,  or  equipment  for  collective  use:  means  of  transport,  machines,  land  vehicles  or  aircraft,  architectural  projects 
for  town  planning,  with  the  very  many  aspects  of  all  these  various  fields,  biometry  is  generally  involved  before  the 
final  development  stage,  e.g.  at  the  design  stage;  taking  biometry  into  account  is  all  the  more  important  as  the  equip- 
ment designed  becomes  more  sophisticated  and  more  expensive  and  for  these  reasons  has  to  be  operated  by  very 
carefully  selected  personnel. 


4.  DIAGRAMMATIC  DESCRIPTION  OF  THE  DATA  PROCESSING  STRUCTURE  OF  THE  BANK 


The  creation  of  the  software  has  been  based  on  two  guiding  principles: 

— reducing  to  the  absolute  minimum  the  volume  of  data  transferred  during  an  interrogation; 

— separating  the  extraction  process  from  the  statistical  processing  operations  of  the  data  and  developing  a 


comprehensive  range  of  extraction  facilities. 


4.1  Reducing  the  Volume  of  Data  Transferred 

The  volume  of  data  transferred  has  been  reduced  to  a minimum  by  means  of  the  following: 

— each  of  the  sets  of  values  corresponding  to  the  various  headings  of  a survey  can  be  accessed  direct; 

- a simple  data  compression  algorithm,  makes  it  possible  to  reduce  to  a minimum  the  space  occupied  by  the  sets 
of  values. 


4.2  Separating  the  Extraction  Processes 


From  the  software  point  of  view  a data  bank  can  be  a generalised  means  of  extracting  data.  The  system  con- 
cerned is  therefore  one  which  processes  efficiently  practically  all  the  questions  which  could  possibly  arise  in  the  field 
of  biometry  and  mechanics. 


Separating  the  extraction  process  from  the  processing  operations  is  the  first  pre-requisite  for  this  kind  of 
efficiency.  The  addition  of  further  processing  is  limited  here  to  further  calculations  based  on  the  data  extracted. 

An  inventory  of  the  extraction  facilities  is  the  second  requirement  for  efficiency.  If  it  is  not  possible  to  provide  from 
the  start  for  the  inclusion  of  all  the  extraction  facilities,  provision  for  at  least  a large  number  should  be  made  and  the 
extraction  system  designed  so  that  it  is  open-ended,  thus  enabling  a new  facility  to  be  added  without  difficulty. 


4.3  Program  Organisation 

The  software  comprises  two  types  of  programs  (see  accompanying  diagrams). 

4.3.1  T!ie  Interrogation  Program 

This  program  performs  the  following  main  functions: 

— it  assists,  during  the  interrogation,  with  formulation  of  the  questions; 

- it  permits  the  functioning  of  all  the  data  extraction  facilities  which  can  be  envisaged  and,  if  necessary,  their 


association  with  a processing  program; 


— finally,  it  makes  it  possible  to  introduce  at  any  time  further  possibilities  for  calculations  based  on  the  data 
extracted. 


4.3.2  The  Update  Program 


This  program  enables  data  from  various  sources  to  be  input  at  any  time  into  the  common  structure  of  the 
Bank’s  software. 


Before  the  input  of  the  individual  data  items  of  a survey  into  the  Bank,  the  various  constituents  of  the  survey 
must  be  described  beforehand  on  cards.  Here  a distinction  is  made  between  general  constituents  (not  related  to  a 
given  survey)  and  specific  constituents. 

Finally,  the  Data  Bank  software  has  the  following  dual  advantage: 

- from  the  programming  aspect,  this  type  of  software  makes  it  possible,  with  maximum  adaptation  facilities,  to 
cater  for  all  the  types  of  processing  which  may  arise; 

— from  the  data  point  of  view,  the  accumulation  of  comparable  data  opens  up  further  processing  possibilities. 


STRUCTURE  OF  THE  BIOMETRY  DATA  BANK 
I.  Input  of  Data  Files 
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II.  Interrogation  of  the  Bank 


5.  CONCLUSIONS  AND  DEVELOPMENT  PROSPECTS 
S.l  Originality  of  the  Human  Biometry  Data  Bank 

It  would  appear  unnecessary  to  go  into  further  detail  about  the  possible  ways  in  which  the  Biometry  Data  Bank 
can  be  of  use  in  all  areas  of  the  design  of  equipment  and  making  the  environment  suitable  for  the  requirements  of 
users.  Its  creation  arose  moreover  from  the  extent  of  the  demand  in  this  field  and  the  inability  of  our  Laboratory, 
and  most  certainly  of  many  others,  to  cater  for  this  demand  because  of  the  lack  of  a tool  of  this  type. 

Because  of  its  particular  design  features  the  Data  Bank  is  a very  versatile  and  complete  tool,  capable  of  dealing, 
at  a minimum  cost,  with  numerous  ergonomic  problems  involving  aspects  concerned  with  morphological  measure- 
ments and  variability.  Design  ergonomics  is  impossible  without  biometry. 

Whereas  with  traditional  methods  each  problems  which  arises  results  in  the  specific  anthropometric  study  of  a 
more  or  less  representative  sample  of  a given  population,  the  Biometry  Data  Bank  makes  it  possible  in  the  majority 
of  cases  either  to  avoid  having  to  conduct  further  surveys,  or  to  reduce  considerably  the  scope  of  any  survey  which 
may  be  necessary.  By  exercising  due  care  it  is  possible  to  constitute  representative  samples  from  sub-represented 
elements  taken  from  several  surveys  and  to  transfer  the  results  of  one  biometric  survey  to  another  with  an  insigni- 
ficant risk  of  error,  taking  into  account  the  industrial  requirement  in  question. 

On  the  data  processing  level  the  Human  Biometry  Data  Bank  differs  both  from  the  general  statistical  programs 
and  from  those  of  conventional  data  banks. 

It  has  the  same  capabilities  as  the  general  statistical  programs  from  the  computational  and  validation  of  data 
points  of  view.  It  can  also  update  sets  of  files  and  retrieve  sub-sets  from  several  surveys. 

Compared  with  conventional  data  banks,  its  distinguishing  feature  is  its  simplicity  of  organisation.  The  fact 
that  the  individual  data  items  are  introduced  for  each  survey  avoids  the  creation  of  groups  and  consequently  avoids 
having  to  give  a hierarchial  structure  to  the  various  records  according  to  the  assumed  frequency  of  the  demands.  When 
the  bank  is  interrogated,  the  biometric  data  item  being  sought  is  extracted  by  scanning  all  the  files.  The  response 
time,  depending  on  the  programs  involved,  is  approximately  one  second. 
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5.2  Proposed  Lines  of  Development 

The  following  lines  of  development  are  under  eonsideration: 

- expanding  the  anthropometric  data  base  by  means  of  new  surveys  provided  by  all  the  laboratories  or  those 
possessing  biometric  data  who  wish  to  associate  themselves  with  the  users  of  the  Bank.  So  far,  the  data  now 
being  put  into  the  Bank  represent,  from  310  different  anthropometric  measurements,  a stock  of  1 800  000 
individual  data  items  received  from  various  European  and  American  laboratories  covering  large  and  varied 
human  groups  in  Europe,  Africa  and  America; 

general  extension  to  other  fields  based  on  regular  surveys  of  interest  to  ergonomics:  reaction  to  vibration, 
variability  of  the  stresses  developed  by  operators  at  their  working  positions  etc; 

- development  of  direct  programming  by  users.  A number  of  simple  calculations  can  already  be  requested  direct 
using  the  conversational  mode.  Making  the  calculation  capabilities  more  complex  and  introducing  the  logic 
requirements  should  enable  the  user  to  process  relatively  complex  problems  directly,  without  a great  deal  of 
training; 

- connection  to  a visual  display  unit  is  now  under  consideration.  Indeed,  it  became  clear  during  simulated  tests 
that  many  problems  in  biometry  and  ergonomics,  e,g.  design  of  equipment  and  driving  compartments,  or 
controls  shaped  to  accord  with  anatomical  structures  (hands,  face,  feet  etc),  could  be  solved  with  a con- 
siderable saving  of  time  by  simulation  techniques  and  visual  display  units. 

In  certain  types  of  problems  the  game  of  question  and  immediate  answer  in  the  conversational  mode  facilitates 
progress  and,  after  successive  trial  and  error  evaluations,  finally  provides  a satisfactory  solution. 

Similarly,  the  possibility  of  modelling  on  a screen  all  the  various  choices  which  generally  arise  when  designing 
equipment  because  of  the  continual  need  for  compromises  between  technical  constraints  and  biological  variability 
aspects  already  appears  to  be  a fruitful  line  of  approach  for  further  development. 


THE  VIRTUAL-SYSTEM  CONCEPT 
OF 

NETWORKING  BIBLIOGRAPHIC  INFORMATION  SYSTEMS* 
J.  FRANCIS  REINTJES** 


Over  Che  past  ten  years  Che  number  of  machine-readable  bibliographic  data  bases 
available  Co  professional  comnunicies  has  increased  steadily.  The  current  number  is 
estimated  to  be  in  the  hundreds  and  is  growing  monthly.  Many  of  these  are  searchable 
through  batch  processing  of  search  requests,  but  there  is  a steady  trend  coward  the  on- 
line mode  of  operation.  In  general,  each  commercially  available  online  bibliographic 
storage  and  retrieval  system  offers  several  data  bases  to  its  clientele,  with  each  data 
base  and  each  atorage  and  retrieval  system,  having  its  own  set  of  access  procedures  and 
protocols.  In  the  science  and  technology  domain,  for  example,  Lockheed's  data  bases, 
numbering  approximately  24,  are  accessible  through  their  DIALOG  retrieval  system  which 
is  implemented  on  an  IBM  360/50  in  Palo  Alto,  California;  the  dozen  or  so  data  bases  at 
System  Development  Corporation,  Santa  Monica,  California,  are  on  an  IBM  370/158  and 
available  through  their  ORBIT  system;  the  National  Library  of  Medicine  has  two  implemen- 
tations of  its  Medline  data  bases,  one  at  Bethesda,  Maryland,  the  ocher  at  the  State 
University  of  New  York  at  Albany.  Since  all  these  systems,  as  well  as  many  others  not 
cited,  have  special  access  requirements  and  since  each  data  base  within  each  system  is 
nonstandardized,  a heavy  burden  of  learning  is  placed' on  the  end  user  — the  seeker  of 
information  — in  his  attempts  to  engage  the  system  personally.  Invariably,  in  the 
interest  of  streamlining  the  search  process  and  mcpediting  results,  a professional  inter- 
mediary with  operational  skills  in  the  array  of  available  systems  is  interposed  between 
host  systems  and  seeker;  but  even  the  intermediary  is  finding  it  difficult  to  cope,  as 
Che  nu^er  of  data  bases  and  systems  continue  to  proliferate. 

It  is  generally  agreed  that,  from  a retrieval  effectiveness  viewpoint,  it  is 
highly  desirable  to  allow  the  seeker  of  information  to  engage  the  information  system  him- 
self rather  Chan  to  have  him  work  through  another  person.  Because  of  the  heterogeniety 
chat  presently  exists  among  data  bases  and  systems  that  contain  them,  however,  placing 
Che  seeker  directly  online  is  impractical.  Two  solutions  immediately  come  to  oiind: 

1)  standardize  all  data  bases,  access  procedures  and  the  hardware  and  software  of  the 
host  systems  to  the  fullest  degree  possible;  or  2)  convey  the  impression  of  standardiza- 
tion to  Che  information  seeker  through  use  of  a computerized  interface  translator  inter- 
posed between  end  users  and  the  systems  they  wish  to  access.  The  interface  thus  creates 
a uniform  "virtual"  system,  and  it  is  this  single,  virtual  system  chat  the  user  engages. 
This  paper  reports  upon  research  being  undertaken  on  the  virtual- system  concept. 

The  logical  structure  of  a virtual  system  is  shown  in  Fig.  1.  The  host  retrieval 
systems  with  their  stored  data  bases  are  designated  "SYS  1",*SYS  2",  and  so  forth.  These 
may  be  geographically  dispersed  nationally  or  even  on  a worldwide  basis.  The  end  users 

are  Ui,  U2,  U3  U^.  Instead  of  accessing  the  retrieval  systems  directly  over 

communications  channels,  users  work  through  an  interface  (we  call  it  the  CONIT,  standing 
for  connector  for  Network  information  Transfer).  The  objective  of  CCBUT  is  to  make  the 
network  of  systems  and  their  data  bases  appear  as  nearly  identical  as  practicable  to 
end  users.  CCRIIT's  role  is  to  make  the  necessary  translations  of  commands  and  responses 
and  to  manage  the  information  flow  back  and  forth  between  user  and  the  system  he  is 
engaging,  so  that  the  user  gains  the  impression  he  is  dealing  with  a network  of  identical 
systems. 

To  accomplish  this,  CONIT  logically  has  three  functions  to  perform:  it  interprets 
user  commands  for  the  network  and  represents  individual-system  responses  to  the  users 
in  a uniform  manner;  it  manages  the  many  protocols  that  must  be  dealt  with  either 
locally  at  the  Interface,  or  remotely  as  information  flows  to  and  fro  between  user  and 
host  system;  and  it  provides  the  necessary  translations  of  commands,  messages  and 
protocols  between  CONIT  and  the  individual  host  system  being  accessed. 


The  work  reported  here  was  made  possible  through  grants  SIS74-18165  and  SIS75-22946 
from  the  National  Science  Foundation. 

**  Professor  of  Electrical  Engineering  and  Computer  Science,  Massachusetts  Institute  of 
Technology,  Cambridge,  Massachusetts  02139. 
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The  central  issue  in  configuring  a translational  interface  concerns  the  degree 
of  "virtualness"  that  one  should  expect  to  attain.  Bound  up  in  this  issue,  in  turn,  is 
the  amount  of  computability  — processing  and  storage  — that  one  is  compelled  and 
willing  to  commit  as  overhead  in  order  to  achieve  a given  level  of  virtualness.  From 
the  viewpoint  of  a seeker  of  information  a virtual  system  should  have  several  character- 
istics: 

(1)  The  system  should  be  simple  to  use  and  require  little  or  no  prior  knowledge  of 

procedures  on  the  part  of  the  user.  This  implies  an  online  step-by-step  . 

instructional  mode  which  in  essence  leads  the  user  "by  the  hand"  through  his  ] 

search  session.  It  also  implies  a command  language  that  is  derived  from  the 

user  community's  native  tongue.  Ideally,  the  command  language  should  allow  i 

free-form  natural- language  expressions,  but  for  practical  reasons,  it  will  very  | 

likely  have  to  be  pre- structured.  j 

(2)  For  those  who  are  expert  in  the  use  of  the  network,  a terse  mode  in  which  1 

commands  are  abbreviated  and  instructions  are  held  to  a minimum  should  be  avail- 
able in  order  to  shorten  the  time  required  to  make  requests  and  to  reduce  host-  j I 

system  "chatter."  ^ j 


(3)  As  a minimian,  it  should  be  necessary  for  the  user  only  to  specify  the  host 

system  and  the  data  base  within  the  host  that  he  wants  to  engage.  No  further 
effort  on  the  part  of  the  user  should  be  required  in  order  to  be  linked  to  the 
data  base.  In  other  words,  logins  and  logouts  should  be  automatic.  A more 
sophisticated  version  of  an  interface  should  go  even  further,  since  it  is 
really  information  the  user  Is  seeking,  not  a specific  system  or  data  base. 
Ultimately,  the  user  should  be  able  to  begin  by  making  his  request  for  a 
specific  piece  of  information  and  then  leave  it  to  the  interface  to  select  the 
system  and  data  base(s)  where  the  information  is  likely  to  reside.  One  way  to 
achieve  this  goal  is  through  use  of  a Master  Index  and  Thesaurus  (MAIT)  stored 
at  the  interface.  In  principle,  the  MAIT  would  consist  of  a collection  of  all 
subject-index  terms  and  perhaps  authors  associated  with  all  the  data  bases  in 
the  system  and  the  particular  systems  and  data  bases  that  contain  the  subject 
terms  or  authors'  names.  Upon  receiving  a subject -search  comnand,  the  inter- 
face would  first  consult  the  MAIT  to  determine  where  the  search  should  be  made 
and  then  make  the  connection  to  the  proper  data  base  of  the  proper  system  and 
execute  the  command. 

(A)  Both  the  minimum  and  advanced  performance  requirements  in  (3)  necessitate  that 
all  communications  connections  and  system  login  procedures  including  dial-ups, 
hand-shaking  procedures  and  acknowledgements  be  handled  automatically;  a 
similar  set  of  automatic  procedures  should  also  prevail  for  logouts. 

(5)  Search  sessions  should  be  conducted  in  a single,  common-command  language  that  is 
independent  of  host-system  command  language. 

The  degree  of  virtualness  of  system  responses  that  should  be  built  into  the 
network  is  an  open  question  and  not  resolvable  until  some  experimentation  with  bona  fide 
users  has  been  completed.  It  may  not  be  necessary,  for  example,  to  require  that  easily 
understood  system  responses  be  mapped  into  a standardized  format.  Individual  system 
responses,  although  different  from  one  another,  may  be  easily  understood  in  their 
original  forms. 

Another  open  question  that  requires  study  pertains  to  procedures  for  extracting 
the  totality  of  information  on  a given  search  request  from  the  various  data  bases  in  the 
various  systems  comprising  the  network.  Inevitably,  there  will  be  redundancies.  In 
these  instances  the  question  is:  Should  duplications  be  recognized  automatically  and 

omitted  before  presentation  to  the  user,  or  should  recognition  of  redundancies  be  left 
to  the  user?  Implementation  of  the  automatic  recognition  and  combinatorial  process  is 
not  trivial  in  the  presence  of  nonstandard  formatting. 

Still  another  unresolved  matter,  as  one  attempts  to  get  actual  users  online, 
is  how  to  design  an  effective  set  of  online  instructions  that  will  lead  the  user  to  the 
information  he  wants  without  getting  "struck"  enroute.  A satisfactory  approach  seems  to 
lie  somewhere  between  the  extremes  of  a highly  intelligent  automatic  instructional  mode 
in  which  the  system  is  always  "in  charge"  and  knows  always  the  next  correct  action  to  be 
taken  by  the  user,  and  a hi^ly  simplified  set  of  rigid  instructions  which,  in  effect, 
provides  only  one  road  map  through  the  search  session. 

To  gain  a better  understanding  of  some  of  these  issues,  an  experimental  Interface 
has  been  impleoiented  on  a Honeywell  6180  machine,  referred  to  at  M.I.T.  as  "Multics." 
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The  information  network  Included  In  the  experimental  setup  is  shown  in  Fig.  2.  At 
present  the  experimental  COnnector  for  Information  Transfer  (CONIT)  contains  some,  but 
not  all,  the  features  and  requirements  cited  above.  The  following  are  salient  features 
that  have  been  implemented  to  date. 

* Five  host  bibliographic-information  systems  are  accessible  through  the  CONIT. 

They  are  Lockheed,  Systems  Development  Corporation,  two  Medlines  (one  at  Albany, 
New  York,  the  other  at  Gaithersburg,  Maryland)  and  the  M.I.T.  IBM  System/370, 
which  contains  the  Intrex  data  base. 

it 

Access  to  all  these  systems  (except  the  local  M.I.T.  system)  is  gained  through 
the  Tymshare  communications  network.  Since  Multics  and  the  National  Library  of 
Medicine's  System/370  are  in  the  ARPA  network,  Medline  at  Gaithersburg  can  also  be 
accessed  through  this  network.  In  addition,  there  exists  a network  patch  which 
permits  access  to  the  local  Intrex  data  base  through  a local  ARPA  Terminal 
Interface  Processor  (TIP).  See  Fig.  2. 
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In  its  current  state  of  development  the  user  must  first  specify  to  CONIT  which  |i 

system  he  wants  to  access  and  the  data  base  within  the  system  he  wishes  to  search. 


It  is  possible  to  be  connected  to  any  system  automatically  through  use  of  the  pick 
command.  Similarly,  a data  base  within  the  selected  system  is  activated  automatic- 
ally through  use  of  the  pick  data  command.  Each  command  is  followed  by  an 
argument  which  defines  the  system  or  data  base  desired. 

Some  of  the  coonands  for  searching  a data  base,  manipulating  retrieved  information 
and  printing  output  which  have  been  implemented  thus  far  are  these:  for  making  a 

subject  search  the  find  command,  followed  by  an  appropriate  argument  is  used.  The 
argument  may  pertain  to  document  titles,  one  or  more  fields  of  each  document 
record  or  to  a specific  named  set  of  documents.  The  show  command,  followed  by  an 
appropriate  argument  is  used  to  display  one,  several  or  all  fields  of  a document 
record.  Documents  retrieved  through  use  of  a specific  find  command  may  be  given 
a set  number  for  later  retrieval  as  a set.  The  current  version  of  CONIT, 
however,  is  able  only  to  pass  on  without  alteration  any  Boolean  operations  that  an 
individual  system  can  perform.  CONIT  also  provides  a browsing  command  (show 
index  L cerm 3 ) , which,  for  any  data  base,  provides  a display  of  terms  alphabetical- 
ly near  to  term  in  the  data  base  index  of  terms. 


i . 


An  example  of  an  actual  USER/COfllT  dialog  is  included  at  the  end  of  the  report, 
with  explanatory  coosnents  added  in  the  rectangular  boxes.  Messages  originating  from 
CGNIT  have  a single  vertical  line  beside  them;  messages  that  have  been  translated  (at 
least  in  part)  are  indicated  by  a double  vertical  line;  and  messages  without  any 
markings  have  been  passed  from  the  originating  retrieval  system  to  user  without  a CONIT 
translation. 

Other  commands  not  discussed  above  nor  included  in  Che  sample  dialog  have  also 
been  implemented  and  are  discussed  in  the  reference  at  the  end  of  the  paper. 

From  our  work  thus  far  we  are  satisfied  that  the  virtual -system  concept  of 
networking  heterogeneous  bibliographic  data  bases  is  technologically  feasible.  Further 
study  is  required,  however,  Co  establish  the  cost/benefits  factor,  the  degree  of 
virtualness  Chat  is  needed  in  order  to  provide  satisfactory  service  and  what,  if 
anything,  could  be  accomplished  in  networking  if  software /hardware  changes  were  made  to 
host  systems,  rather  than  leaving  them  unaltered,  as  is  now  being  done. 


Reference ; Marcus,  Richard  S.,  and  Reintjes,  J.  Francis,  The  Networking  of 
Interactive  Bibliographic  Retrieval  Systems,  Report  ESL-R-656, 
March,  1976. 
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SAMPLE  OF  A USER/CCMIT  DIALOG 


con  i t 

Welcome  to  CONIT.  For  help  on  how  to  use  COMIT  you  may  type  'help' 
followed  by  a carriage  return;  otherv/lse,  you  may  now  type  any  CONIT 
command. 

USER;;  

pick  SUny  I n.mir'Am  Cl  i^^ccc  11.1  ATTA^uik.1^  T-in  drxfkTl 


I Attachment  successful. 


INDICATES  SUCCESS  IN  AHACHING  TIP  PORT 


SUNY/MEDUNE; 
Connection  completed. 


INDICATES  PROPER  INITIAL  RESPONSE  SEEN 


II  SS  1 Is  number  for  your  next  search  set. 

, suny  Is  connected  successfully  I INDICATES  LOGIN  COMPLETED  | 

I You  are  now  speaking  In  CONIT  (THESE  MESSAGE^ AND  OTHERS^  COULD  BE  IN 

BEHER,  MORE  VIRTUAL  FORAAAT  AS  DISCUSSED 

II  USER;;  IN  REPORT) 

find  radiation  "" 


SUNY/MEDLINE; 

Your  search  resulted  In  setl  which  contains  this  many  documents;  (268U 
SS  2 Is  number  for  your  next  search  set. 

II  USER;; 

show  title  docsl-3 


II  SUNY/MEOLINE; 


Tl  - ; STATIST  I CAL  EVALUATION  OF  LIGHT  PROTECTION  FACTORS; 


Tl  - PROGNOSIS  AND  POST-THERAPEUTIC  FOLLOW-UP  OF  BREAST  CANCERS  BY 
THERMOGRAPHY. 


Tl  - MICROWAVE  THERMOGRAPHY;  A METHOD  OF  DETECTING  SUBSURFACE  THERMAL 
PATTERNS. 

SS  2 Is  number  for  your  next  search  set. 


pfck'lms  f;<y:KHEED  DIALOG^SYgTEM  PICKED 
I sen!  INDICATES  LOGIN  SENT  j 

MOST  IS  ONLINE 

MMMIf  RECONNECT  @ 15;53;05 


MEW  FILE  - DISSERTATION  ABS.  - SEE  7NEWS 


II  USER;; 

pick  data  eric 


run 


SAMPLE  OF  A USER/C«ilT  DIALOG  - continued 
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File  reset:  ERIC  FULL-TEXT  ED  & EJ  DEC-7 
USER: : 

find  radiation 


Your  search  resulted  In  set2  454  RADIATION  (PROCESS  OF  ENERGY 


USER: : 

show  docsQshow  title  docsl-3 


Msg  from  1: DIALOG  GOING  DOWN  FOR  THE  DAY 
IM  5 MINUTES 

1 

EJ121G00 

Solar  Heated  Homes:  They're  Here 

2 

EJ121508 

A Course  in  Nuclear  Radiation  for  All  High  School  Students 
5 

EJ121506 

Energy  Alternatives 


USER: : 
pick  sdc 
sent 

YOU  ARE  ON  LINE  L8A 

HELLO  FROM  SDC/ORBIT. 

YOU  ARE  NOW  CONNECTED  TO  THE  ORBIT  DATABASE. 


|SDC  ORBIT  SYSTEM  PICKED 


@ CANCELS  LINE 
TO  THAT  POINT 


NOTE  SPECIAL  AND  IMPORTANT 
MESSAGE  IN  LOGIN 


<■ 


f 


I 


:;1 


SDC/ORBIT: 


TODAY  ONLY:  GEOREF  AND  APIPAT  WILL  NOT 
FOR  THE  INCONVENIENCE. 

• ••* 


BE  AVAILABLE.  SORRY 
I NOTE  SPECIAL  MESSAGE 


SS  1 Is  the  number  of  your  next  SDC/ORBIT  search  set. 


USER:: 

pick  data  ntls 


r i 


i I 
i ■ 
> 


r 


SDC/ORBIT: 

THE  TIME  IS  NOW  6 :56  P.M.  (EST),  01/16/76 
YOU  ARE  NOW  CONNECTED  TO  THE  NTIS  DATABASE. 

SS  1 Is  the  number  of  your  next  SDC/ORBIT  search  set. 


■ 1 
d 


USER:: 

find  radiation 


SAMPLE  OP  A USER/COHIT  DIALOG  - contlnutd 


SOC/ORBIT: 

SS  1 is  resulting  set  containing  this  many  documents: (21223) 
SS  2 Is  the  number  of  your  next  SDC/ORBIT  search  set. 

USER: : 

show  title  docsl-3 


SDC/ORBIT: 

Tl  - Results  of  the  Lyman  alpha  Measurements  of  the  Satellite  Dial 
Ergebnisse  der  Lyman-alpha  Messungen  des  Satelllten  Dial 

Tl  - Materials  Data  Retrieval  at  Estec 

Tl  - Angular  Distributions  of  Electrons  of  Energy  Esub  E Greater  Than 
0.06  Mev  In  the  J ovian  Magnetosphere 

SS  2 Is  the  number  of  your  next  SDC/ORBIT  search  set. 

USER:: 
pick  nim 
sent 

'logon  in  PROGRESS  AT  18:59:30  ON  JANUARY  16,  1976 
•**  BACK72  AT  NLM  AND  SUNY  CONTAINS  1972  CITATIONS  ONLY.*** 

TOTAL  ACTIVE  TSO  USERS:  06 
READY 


TSO  LINE  07E  , 

ELHILL3  IS  NOT  AVAILABLE  AT  THIS  TIME.  I NOTE  IMPORTANT  MESSAGE 
READY  ' 


USER: : 

pick  suny 

Can't  log  off  nim 


USER: : 
sp  host 

Transparent  mode.  All  of  your  Input  will  now  be  sent  directly  to  nim  without 
Interpretation, except  the  'speak  conIt'  command  wMch  will  cause  COMIT  to 
resume  Interpreting  your  requests. 

USER: : 
logoff 


CONIT  DOES  NOT  YET  RECOGNIZE  UNAVAILABILITY 
MESSAGE  AND  GETS  CONFUSEtt  ANALYST  GOES 
INTO  TRANSPARENT  MODE  TO  CARRY  OUT  TASK 
■MANUALLY  CVITHOUT  TRANSLATION) 


LOGGED  OFF  TSO  AT  19:03:37  ON  JANUARY  16,  1976+ 

DROPPED  BY  HOST  SYSTEM 
PLEASE  LOG  IN: 

USER:: 

sunyU 


PASSWORD: ben 


THE  NATIONAL  STANDARD  REFERENCE  DATA  SYSTEM 


Stephen  A.  Rossmassler 
Office  of  Standard  Reference  Data 
National  Bureau  of  Standards 
Washington,  DC  20234 


The  National  Standard  Reference  Data  System  Is  a coordinated,  but  decentralized,  effort  to  Increase 
the  reliability  and  availability  of  numerical  data  used  In  and  produced  by  the  physical  sciences  and 
engineering.  Individual  data  projects  on  specific  technical  subjects  are  established  to  extract,  evaluate, 
and  compile.  In  a systematic  manner,  all  relevant  data  from  the  scientific  journal  and  technical  report 
literature.  The  evaluation  process  compresses  the  original  data,  and  the  systematic  treatment  aids  the 
user  In  filling  his  data  needs.  Sophisticated  data-handling  capabilities  — Including  on-line  Information 
and  data  retrieval  — are  developed  In  Individual  data  centers  and  also  In  a central  data  systems  design 
group. 


Introduction 


A major  reason  for  the  effectiveness  of  scientific  research  and  development  1s  the  fact  that  today's 
scientists  and  engineers  can  build  on  what  was  done  yesterday,  and  last  year,  and  ten  years  ago.  The 
results  of  past  scientific  work,  carefully  reported,  appropriately  published,  are  a basic  part  of  the 
Intellectual  equipment  of  the  technical  worker.  This  Is  particularly  true  of  the  numerical  results  of 
scientific  observation,  the  quantitative  answers  which  Nature  has  given  to  our  questions.  We  call  these 
nunters  "data".  Data,  combined  with  the  technical  understanding  which  comes  from  training  and  experience, 
are  the  tools  of  the  scientist  and  engineer. 

But  a tool  is  useful  only  if  it  is  reliable,  and  at  hand  to  do  the  job.  Data  must  be  accurate  and 
available  If  good  results  are  to  be  achieved. 

Such  data  are  the  concern  and  the  reason  for  the  existence  of  the  National  Standard  Reference  Data 
System  (NSRDS).  But  why  a special  activity,  since  many  scientific  journals  publish,  abstract,  and  index 
a great  volume  of  literature  each  year?  What  more  is  needed?  From  the  user's  viewpoint,  much  more  Is 
needed.  The  very  bulk  of  the  scientific  literature  poses  great  problems  for  the  data-seeker.  Even  a 
carefully  worded  Inquiry  may  result  in  more  literature  citations  than  he  can  read  and  assimilate.  And 
when  he  has  combed  through  fifty  (or  five  hundred)  papers  and  extracted  the  relevant  data,  how  can  he  pick 
the  best  values?  The  published  values  for  the  thermal  conductivity  of  copper,  for  example,  range  widely. 
Not  all  of  the  numbers  can  possibly  be  right.  The  answer  is  to  involve  the  expert,  and  to  encourage  him 
to  evaluate  the  data  in  his  own  field  of  special  competence.  Such  a program  provides  reliable  data.  At 
the  same  time,  because  inaccurate  or  poorly  documented  data  are  weeded  out,  and  inconsequential  papers  are 
discarded,  data  evaluation  compresses  the  literature,  often  by  a factor  of  more  than  100,  sometimes  by 
a factor  of  1000.  In  this  way  the  data  are  also  made  much  more  accessible. 

The  NSRDS,  as  a technical  information  system,  offers  a number  of  retrieval  concepts  which  warrant 
your  attention  today.  The  first  feature  is  the  one  I have  just  mentioned  — the  process  of  systematic 
thorough  screening  and  evaluation  of  the  original  literature  does  much  to  facilitate  data  retrieval  by 
the  user.  At  the  same  time,  each  data  compilation  desirably  contains  citations  to  the  original  sources. 
Thus,  for  the  scholar,  both  the  expert  and  the  non-expert,  data  compilations  are  an  excellent  point  of 
entry  into  the  literature  on  each  special  subject  covered. 

The  second  feature  of  interest  is  the  information-handling  capabilities  which  the  Individual  special- 
ized data  centers  develop  In  order  to  do  their  jobs  efficiently. 

The  third  feature  involves  the  general-purpose  data-manipulation  procedures  developed  within  the 
NSRDS  for  use  by  data  centers  and  those  who  use  their  products.  These  two  latter  features  will  be 
described  in  more  detail  later  in  this  presentation. 

History  and  Mission 

Numerical  data  conpilation  and  evaluation,  as  a valid  part  of  science,  have  a long  history,  full  of 
notable  contributions.  The  Periodic  Table  of  Mendeleev,  familiar  to  freshman  chemistry  and  physics 
students.  Is  a good  example.  Other  early  contributions  Include  the  Landolt-Bomsteln  Tables  (which  are 
still  being  updated  and  published)  and  many  others.  The  National  Bureau  of  Standards  has  taken  part  in 
this  work  for  over  half  a century.  The  editor  of  the  International  Critical  Tables,  Dr.  Edward  Washburn, 
was  a member  of  the  NBS  staff. 

While  such  data  evaluation  projects  were  highly  valuable,  they  lacked  a central  focus.  There  was  no 
responsibility  for  planning,  or  for  setting  of  priorities.  To  remedy  these  deficiencies,  the  Federal 
Council  for  Science  and  Technology,  in  1963,  enunciated  a Federal  Policy  that  there  be  a "National  Standard 
Reference  Data  System."  The  National  Bureau  of  Standards  was  given  responsibility  for  the  administration 
of  the  system,  and  for  the  coordination  of  the  data  programs  of  other  Federal  agencies.  A small  program 
management  office,  the  Office  of  Standard  Reference  Data  (OSRD),  was  established  for  this  purpose. 

This  policy  was  augmented.  In  1968,  by  the  Standard  Reference  Data  Act,  Public  Law  90-396,  which 
expressed  Congressional  endorsement  of  the  general  undertaking.  The  Act  also  authorized  the  Secretary  of 
Coimerce  to  secure  copyright  on  standard  reference  data  compilations,  and  to  sell  them  as  part  of  a cost- 
recovery  program.  The  copyright  authority  Is  not  unique  among  Federal  agencies,  but  it  is  unusual. 
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The  general  mission  of  the  NSRDS  Is  defined  In  these  two  documents  — the  1963  Policy,  and  the  1968 
legislation.  Refinements  and  added  details  have  emerged  with  twelve  years  of  operation.  Basically  there 
are  five  aspects  to  this  mission,  as  follows: 

1.  To  plan,  fund,  and  administer  a program  of  data  evaluation  In  the  physical  sciences; 

2.  To  publish  compilations  of  data; 

3.  To  coordinate  data  programs  nationally; 

4.  To  provide  a focus  for  Interaction  with  similar  programs  of  other  countries,  and  with  Inter- 
national efforts;  and 

5.  To  Improve  the  quality  of  laboratory  measurements  through  provision  of  benchmark  data, 
standards,  criteria,  and  the  like. 

Operational  Features 

The  data  evaluation  program  Is  decentralized.  In  order  to  Involve  the  participation  of  leading 
experts,  wherever  they  work,  the  actual  compilation  and  evaluation  takes  place  In  many  widely  separated 
groups.  The  fact  that  these  groups  vary  greatly  In  size,  organizational  structure,  and  source  of  finan- 
cial support  leads  to  a certain  degree  of  diffuseness  in  the  system.  However,  It  Is  Instructive  to 
recognize  two  broad  types  of  components.  The  first  Is  the  continuing  data  center,  which  has  a charter 
(formal  or  Informal)  to  cover  a certain  technical  area  on  a continuing  basis.  Ideally,  such  centers  have 
an  assurance  of  stable  long-term  financial  support.  The  task  of  a data  center  Is  to  search  the  world 
literature  on  a regular  basis,  to  retrieve  and  Index  papers  within  Its  scope  of  Interest,  to  extract  the 
numerical  data,  and  to  carry  out  critical  evaluation  leading  to  publication  of  tables  or  reviews.  Such 
centers  are  normally  located  In  the  environment  of  a scientific  laboratory.  Many  of  the  centers  are 
highly  mechanized  and  have  large,  computer-based  files  from  which  bibliographic  citations  and  data  can  be 
retrieved.  Generally,  they  are  able  to  respond  to  requests  for  specific  Information  from  the  scientific 
public. 

The  other  type  of  component  is  the  Individual  scientist  (or  small  group  of  collaborators)  who  pro- 
duces a "one-shot"  compilation  or  critical  review  as  a part  of  what  he  regards  as  his  normal  scientific 
activity.  Many  valuable  data  compilations  have  been  produced  In  this  way.  Such  Individuals  do  not  con- 
sider themselves  part  of  a formal  data  center,  and  there  Is  generally  no  commitment  for  continuity  or 
updating.  The  rapid  growth  of  the  scientific  literature  makes  It  increasingly  difficult  for  an  Individual 
to  do  this  type  of  compilation.  However,  the  continuing  data  centers  can  serve  a useful  function  by  ! 

providing  bibliographic  back-up  for  Individual  scientists  In  other  locations  who  wish  to  write  critical 
reviews  or  do  critical  compilations  of  limited  scope. 

In  the  pattern  which  has  emerged.  It  is  clear  that  both  types  of  components  are  essential  for  j 

successful  operation. 

One  of  the  primary  goals  of  the  Office  of  Standard  Reference  Data  Is  to  establish  continuing  data  ] 

centers  in  all  technical  areas  which  fall  within  the  scope  of  the  program.  Since  the  resources  of  OSRD  1 

are  limited,  every  effort  Is  made  to  encourage  other  Federal  agencies  and  private  organizations  to  par-  j 

t1 cl  pate  In  the  support  of  these  activities.  j 

i 

In  the  period  since  the  establishment  of  NSRDS,  data  evaluation  activities  In  other  parts  of  the 
world  have  Increased  considerably.  Formal  governmental  programs  similar  to  NSRDS  have  been  established 
In  several  countries.  In  the  U.  K. , the  Science  Research  Council  administers  a program  which  Includes 
the  support  of  a number  of  data  centers  In  the  physical  sciences.  In  the  Soviet  Union,  the  Academy  of 
Sciences  supports  several  data  evaluation  projects,  and  the  State  Service  for  Standard  Reference  Data 
(GSSSD)  has  responsibilities  similar  to  those  of  NSRDS  for  scientific  and  technical  data.  The  Office  of 
Standard  Reference  Data  maintains  liaison  with  these  groups  with  the  aim  of  avoiding  duplication  In  data 
compilation  projects  and  promoting  maximum  compatibility  of  output.  To  effect  this  liaison,  proposals 
requesting  support  of  projects  In  subject  areas  of  mutual  concern  are  exchanged  for  comment. 

The  establishment  of  the  Committee  on  Data  for  Science  and  Technology  (CODATA)  In  1966  has  provided 
a formal  framework  for  International  cooperation.  CODATA  Is  a committee  of  the  International  Council  of 
Scientific  Unions  (ICSU).  The  Secretariat  Is  located  In  Paris.  The  main  purpose  of  CODATA  Is  to  encour- 
age, on  a world-wide  basis,  the  production  and  distribution  of  critically  evaluated  numerical  data. 

Since  the  entire  thrust  of  the  NSRDS  Is  user-oriented,  much  attention  Is  given  to  Interactions  with 
users.  Such  Interactions  fall  Into  three  general  categories: 

1.  Data  products  and  services.  These  are  discussed  In  detail  below. 

2.  Feedback  mechanisms.  Including  surveys.  Interviews,  questionnaires,  symposia,  and  correspondence. 

3.  Advisory  groups  representing  special  user  groups,  specific  scientific  discipline  Interests,  etc. 

Products  and  Services 

Most  of  the  output  of  the  NSRDS  consists  of  compilations  of  evaluated  data  and  critical  reviews  of 
the  state  of  quantitative  knowledge,  each  covering  a specific  (usually  quite  narrow)  subject.  These 
products  are  published  through  a variety  of  channels.  Including  the  following: 

Journal  of  Physical  and  Chemical  Reference  Data.  A quarterly  Journal  containing  data  compilations 

and  critical  data  reviews,  published  for  the  National  Bureau  of  Standards  by  the  American  Institute 

of  Physics  and  the  American  Chemical  Society. 

NSRDS-NBS  Series.  A publication  series  distributed  by  the  Superintendent  of  Documents,  U.S.  Govem- 

ment  I’rinting  Offi  ce , 
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Appropriate  publications  of  tecbnical  societies  and  connerclal  publishers.  Handbook  publications 

are  a particularly  Inportant  element  of  this  category. 

A recapitulation  of  the  output  of  ten  years  of  program  operation  showed  160  compilations,  covering 
36,000  pages  and  over  30,000  materials. 

Recently  the  Office  of  Standard  Reference  Data  has  begun  to  disseminate  some  large  data  files  In  the 
form  of  magnetic  tapes.  These  are  sold  through  the  National  Technical  Information  Service.  At  present, 
three  data  tapes  are  available,  on  mass  spectral  data,  atomic  spectra,  and  crystallographic  data. 

Going  beyond  publications,  an  important  category  of  Information  and  data  services  1s  the  answering 
of  Inguiries.  Both  the  OSRD  and  its  affiliated  data  centers  (about  20  In  number)  attempt  to  answer, 
within  restrictions  Imposed  by  limited  staff  and  time.  Inquiries  concerning  data  in  the  physical  sciences 
and  engineering.  Information  and  data  are  provided  from  NSRDS  publications  and  other  available  sources. 
At  times  references  are  provided;  If  available  and  appropriate,  copies  of  publications  or  excerpts  there- 
from containing  the  requested  data  are  furnished.  OSRO  normally  receives  about  1000  requests  and  in- 
quiries per  year.  Under  a recently  established  cost-recoveiy  program,  custom  services  which  require  a 
substantial  amount  of  time  are  billed  to  the  requestor. 

Information  Retrieval  Technology 

It  has  already  been  pointed  out  that  the  process  of  data  compilation  and  evaluation  Improves  the 
accessibility  of  data,  as  compared  with  their  status  In  the  original  journal  and  report  literature.  The 
improvement  takes  place  In  several  ways.  First,  the  primary  publication  usually  focuses  on  a research 
or  development  project  of  which  data-gathering  Is  only  a part.  Data  results  are  frequently  buried  In 
the  text,  and  are  not  Identified  in  the  abstract.  Systematic  compilation  leads  to  greater  visibility, 
and  republication  takes  place  through  a medium  explicitly  devoted  to  data. 

Second,  data  tabulation  Involves  eliminating  many  of  the  non-data  aspects  of  the  original  paper. 

The  effect  Is  a major  reduction  in  volume,  usually  more  than  100-fold.  For  example.  In  a recent  issue 
of  the  Journal  of  Physical  and  Chemical  Reference  Data,  an  article  on  "Critical  Region  Parameters"  cited 
and  extracted  data  from  162  references;  another  on  "Diffusion  In  Copper  and  Copper  Alloys"  derived  data 
from  484  references.  Such  compaction  is  very  helpful  to  the  data-seeking  user  In  his  retrieval. 

Third,  the  user  gains  greatly  because  the  data  compiler  makes  a commitment  to  cover  the  literature 
comprehensively.  The  latter's  literature  search  is  extensive  and  systematic.  While  no  compiler  claims 
perfection  In  coverage  of  sources,  the  compilation  user  Is  afforded  the  benefits  of  a search  In  which  an 
acknowledged  expert  tried  to  do  the  job  completely.  Many  data  centers  feel  that  they  probably  capture 
close  to  90X  of  the  pertinent  literature. 

Beyond  this  Inherent  Improvement  In  retrieval  capability,  the  NSRDS  works  both  in  its  data  centers 
and  centrally  to  achieve  Improvements  In  Information  retrieval  technology. 

Data  centers  have  a strong  selfish  Interest  In  such  Improvements.  They  have  a heavy  work  load,  a 
responsibility  to  search  both  the  past  and  current  publications  within  their  subject  scope,  and  the 
difficult  scientific  task  of  data  evaluation.  Accordingly,  they  develop  advanced  and  highly  effective, 
though  often  very  specialized.  Information  retrieval  systems.  Typical  features  of  such  systems  are  the 
following: 

1.  Reabstracting  of  the  original  literature,  with  extraction  or  annotation  and  filing  of  data 
content. 

2.  Systematic  Indexing  of  data  contents  for  computer  searching. 

3.  Magnetic  card  or  punch  paper  tape  recording  of  data  files,  to  reduce  retyping. 

4.  Fixed-field  formats  for  in-house  records  to  assist  In  retrieval. 

5.  Automated  preparation  of  bibliographies  as  a spin-off  of  In-house  records. 

6.  Final  publication  via  computerized  typesetting  from  In-house  files. 

More  broadly  applicable  are  the  data  handling,  manipulation,  and  retrieval  capabilities  developed  by 
the  OSRD  Data  Systems  Design  Group.  The  capabilities  have  recently  been  combined  into  a general-purpose 
data  management  system  called  Omnidata.  This  system  has  provision  for:  Information  and  data  retrieval; 

numerical,  statistical,  and  graphical  analysis;  file  definition  and  updating;  encoding  and  decoding  of 
data  fields;  multi-key  sorting;  arithmetic  operations  and  general  file  manipulation.  Going  beyond  the 
normally  available  routines  for  searching,  arithmetic  operation,  statistical  analysis,  and  report  genera- 
tion, Omnidata  offers  36  modules.  Including  such  user-oriented  procedures  as  Browse,  Rename,  Graph,  and 
Tally,  as  well  as  file-building  procedures,  such  as  Check  Sum,  Screen,  and  Update. 

A detailed  description  of  this  system  will  be  available  In  a forthcoming  NBS  Handbook,  "Omnidata: 

A Computer  Program  for  Data  Retrieval,  Statistical  and  Graphical  Analysis,  and  Data  Management,"  by 
Hllsenrath  and  Breen. 

Those  who  may  want  a more  comprehensive  overview  of  the  National  Standard  Reference  Data  System  are 
referred  to  a recent  version  of  th.'  status  report  "Critical  Evaluation  of  Data  In  the  Physical  Sciences," 
NBS  Technical  Note  881.  The  report  contains  a list  of  data  projects  and  data  centers,  a publication 
list,  and  other  details.  It  Is  available  from  the  Office  of  Standard  Reference  Data,  or  from  the  Govern- 
ment Printing  Office  as  SO  Catalog  No.  C 13.46.881.  The  price  Is  $1.10,  plus  25X  postage  outside  the 
United  States. 
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Synthesis  and  Distribution  of  Envlronnental  Satellite  Data 


James  I.  Vette.  National  Space  Science  Data  Center 
Code  601.  Goddard  Space  Flight  Center 
Greenbelt,  Nary land.  20771 


Suinnary 

The  activities  of  two  National  Oceanographic  and  Atmospheric  Administration  and  two  National  Aero- 
nautics and  Space  Administration  facilities  Involved  In  the  synthesis  or  distribution  of  space 
environmental  data  are  reviewed.  The  data  products,  user  services,  and  pertinent  publications  are  given. 
The  computer  systems  that  support  three  of  these  facilities  are  discussed  and  some  details  of  the  synthesis 
procedures  are  given.  Data  from  the  following  satellite  series  are  Included  In  the  discussion:  TIROS, 
NIMBUS.  ESSA,  NOAA,  SMS/GOES,  ATS,  SOLRAD.  Orbital  position  data  for  HAWKEYE  1,  IMP  HW,  VELA  SB,  PIONEERS 
6-9,  PIONEER  11,  HELIOS  1 & 2,  and  SOLRAD  llA  t IIB  In  several  coordinate  systems  are  discussed. 
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1 Introduction 

The  use  of  earth-orbiting  spacecraft  to  obtain  environmental  data  on  a nearly  global  basis  has  Increased 
tremendously  In  the  past  decade.  In  this  same  period,  the  growing  sophistication  of  flight  Instruments  and 
Increased  data  rates  require  advanced  techniques  to  synthesize  and  distribute  effective  data  products  to  the 
various  users.  To  detail  all  the  spacecraft  programs  acquiring  and  processing  environmental  data  Is  beyond 
the  scope  of  this  paper.  However,  to  Indicate  some  of  the  techniques  that  are  being  employed  and  to  provide 
an  Idea  of  the  types  of  environmental  data  available,  specific  examples  have  been  chosen  for  discussion. 

These  encompass  meteorological,  earth  resources,  near-earth  charged  particle,  solar  electromagnetic,  and 
orbital  position  data.  The  techniques  employed  Involve  Image  processing,  digital  data  processing  and 
compression,  photographic  processing,  interactive  computer  graphics,  and  computer  generated  photographs. 

The  distribution  ranges  from  on-line  systems  with  real  time  data  bases  to  synthesized  products  which  are 
available  upon  request  by  mall  or  telephone. 

The  activity  of  four  separate  facilities  will  be  presented.  The  Satellite  Data  Services  Branch  (SDSB) 
of  the  Environmental  Data  Services'  (EOS)  National  Climatic  Center  (NCC)  of  the  National  Oceanographic  and 
Atmospheric  Administration  (NOAA)  provides  environmental  and  earth  resources  satellite  data  to  users  once  j 

the  weather  forecasting  services  have  been  provided.  Emphasis  will  be  placed  on  meteorological  data  from 
the  NOAA  operational  series  (known  as  ITOS  before  launch)  and  the  geostationary  spacecraft  series  SMS/GOES. 

The  Space  Environment  Laboratory  (SEL)  Data  Acquisition  and  Display  System  (DAOS)  Is  a real-time  solar-  ( 

terrestrial  environment  data  system  operated  by  SEL  of  NOAA's  Environmental  Research  Laboratories.  Data 
on  solar  x-rays,  charged  particles,  and  vector  magnetic  fields  are  obtained  from  SMS-1,  SMS-2,  and  GOES-1 
satellites  in  real  time.  Other  spacecraft  and  ground-based  measurements  are  available  through  the  system. 

Two  National  Aeronautics  and  Space  Administration  (NASA)  facilities,  the  Atmospheric  and  Oceanographic 
Information  Processing  System  (AOIPS)  and  the  International  Magnetospheric  Study  (IMS)  Satellite  Situation 
Center  (SSC),  are  both  located  at  the  Goddard  Space  Flight  Center  (GSFC).  The  AOIPS  Is  used  to  process 
LANDSAT  data  for  land  use,  hydrological,  oceanographic,  and  geological  studies  and  SHS/GOES  data  for  mete- 
orological research.  The  IMS/SSC  processes  both  predicted  and  achieved  orbital  position  data  for  a number 
of  satellites  In  a variety  of  coordinate  systems  and  determines  Interesting  confluences  of  this  ensemble 
of  spacecraft. 

The  thrust  of  this  paper  Is  to  provide  some  Insight  into  the  work  being  accomplished  at  these  facil- 
ities and  to  Indicate  the  distribution  of  the  resulting  data  products.  It  Is  not  surprising  that  computers 
play  a vital  role  In  these  activities;  however.  It  may  be  unexpected  that  three  of  these  facilities  rely 
heavily  on  mini -computers. 


II  The  Satellite  Data  Services  Branch 

This  facility  Is  collocated  with  the  operations  center  of  NOAA's  National  Environmental  Satellitt 
Service  (NESS)  In  the  World  Weather  Building,  5200  Auth  Road,  Washington,  D.C. , 20233,  and  serves  as  a 
data  source  for  (a)  environmental  data  from  TIROS,  NIMBUS,  ATS  (I  & III  only),  ESSA,  NOAA,  and  SMS/GOES 
' spacecraft,  (b)  earth  resources  data  from  LANDSAT  spacecraft,  and  (c)  photographs  of  the  earth  from 

I SKYLAB  missions.  A more  detailed  summary  of  their  holdings  and  services  has  been  published  recently 

f (Reference  1). 

I The  visible  light  and  Infrared  data  taken  by  a series  of  Improving  experiments  on  the  spacecraft 

’ listed  In  (a)  have  been  processed  to  produce  photographs.  In  addition,  the  digital  data  from  many  of 

the  later  experiments  are  available  on  magnetic  tape  since  each  photographic  step  tends  to  degrade  the 
data  and  many  users  wish  to  employ  special  digital  processing.  Currently  SDSB  receives  dally  about  250 
i negatives  from  the  polar-orbiting  NOAA  spacecraft  and  about  250  negatives  of  full-disc  and  sector  Images 

from  the  SMS/GOES  spacecraft.  The  processing  of  the  original  data  are  done  by  NESS  and  the  final  pro- 
ducts  are  provided  to  SDSB  for  distribution. 

The  geostationary  spacecraft  provide  full-disc  and  sector  Images  from  the  Visible  and  Infrared  Spln- 
;t  Scan  Radiometer  (VISSR).  The  visible  full-disc  photographs  are  produced  with  a two  mile  resolution  and 

; the  sector  Images  with  a one  mile  resolution.  These  products  are  Important  for  studying  global  weather 

’ systems  and  provide  the  evolution  and  motion  ofcload  patterns.  An  example  of  the  full-disc  Image  is  shown 

In  Fig.  1.  Computer  generated  hemispherical  composites  from  the  lower  altitude  NOAA  satellites  are  produced 


f 


In  polar  stereographic  projections  qridded  with  circles  of  latitude  and  rays  of  longitude;  an  example 
is  shown  in  Fig.  2.  Mercator-projection  composites  for  middle  and  low  altitudes  are  also  9enerated 
with  a rectangular  latitude-longitude  grid  (Fig.  3).  Infrared  data  are  used  to  generate  both  nighttime 
and  daytime  composites,  while  visible  data  are  only  useful  for  daytime.  The  composites  are  produced  by 
the  Scanning  Radiometer  (SR)  on  the  NOAA  satellites;  additional  data  are  obtained  from  the  Very  High 
Resolution  Radiometer  on  board  NOAA  3 and  4,  which  has  1 Km  resolution  at  the  nadir  for  both  visible  and 
infrared  channels.  A detailed  description  of  the  procedures  used  to  produce  the  digitized  mosaics  is 
available  (Reference  2). 


Fig.  2.  Computer-generated  Northern  Hemisphere 
mosaic  photograph.  This  composite  was  generated 
from  the  NOAA-2  Scanning  Radiometer  visible  channel 
(0.52-0.73  urn)  data  acquired  on  12  orbits  during 
August  25-26,  1974.  The  GMT  times  mark  the  equa- 
torial crossing  grid  so  that  nominal  time  for  each 
swath  can  be  estimated.  The  satellite  is  in  a sun- 
synchronous  orbit  at  an  altitude  of  1450  Km  with  an 
inclination  of  101.7°.  Using  the  Greenwich  meridian 
one  can  determine  the  local  time  for  each  daytime 
swath  is  approximately  9-9:30  a.m.  Resolution  is 
4 Km  at  the  nadir  and  about  10  Km  at  1600  Km  to 
either  side  of  the  nadir. 


Fig.  1.  Full-disc  visible  photograph  of  the  earth. 
This  picture  was  taken  by  the  VISSR  on  SMS-1  on 
July  17,  1975  at  1700  UT  with  the  visible  channel 
(0.55-0.75  urn).  The  sub-satellite  point  was 
1.5°S,  75.0°W.  The  instrument  also  has  an 
infrared  channel  at  10.5-12.5  ym.  The  gridding 
and  landmarking  were  done  by  computer  during  the 
picture  construction  at  NESS.  The  instrument  is 
capable  of  0.9  Km  resolution  at  the  nadir  but 
this  product  has  about  3 Km  resolution.  Approx- 
imately 18  minutes  scanning  time  are  required 
for  the  instrument  to  obtain  the  data  to  produce 
this  picture. 


The  data  formats  for  the  various  products  include  photoprints  (usually  10"  x 10"  on  matte  paper), 
negatives,  transparencies,  positive  or  negative  35  mm  reels,  slides,  and  movie  loops  of  changing  weather 
patterns.  A Catalogue  of  Operational  Satellite  Products  has  been  published  (Reference  3).  EDS  publishes 
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Fig.  3.  Mercator-projection  mosaic  photograph.  This  composite  was  generated  from  the  NOAA-2  Scanning 
Radiometer  infrared  channel  (10.5-12.5  pm)  data  acquired  during  August  25,  1974.  The  numerous  stonn 
systems  at  low  northern  latitudes  are  clearly  seen.  Resolution  is  half  as  great  as  for  the  visible 
channel  shown  in  Fig.  2. 


"Key  to  Meteorological  Records  Documentation  No.  S.4  (Environmental  Satellite  Imageryfas  a monthly  Issue 
describing  data  available  from  NOAA  satellites  which  Includes  5“  diameter  dally  mosaics  (visible  and 
Infrared)  for  both  Northern  and  Southern  Hemispheres. 


Ill  SELDAOS 


SELDAOS  Is  a real-time  solar-terrestrial  environment  data  system  operated  by  SEL  located  at  325  South 
Broadway.  Boulder,  Colorado  80302.  A detailed  description  of  the  system  has  been  given  recently  by  Williams 
(Reference  4).  The  general  concept  Is  displayed  In  Fig.  4.  A variety  of  experiments,  both  ground-based  and 
satellite,  furnish  data  to  the  system. 

The  satellite  data  provide  measurements  of  ■ SeliM  W -if'  Ciaund  kaM  ~ I 

solar  x-rays,  solar  wind  plasma,  electrons,  pro-  // 

tons,  alpha  particles,  and  vector  magnetic  fields  SMS  I SMS  r iTOk  khiiaohi 

at  synchronous  altitudes.  More  details  of  these  // 

experiments  are  presented  In  Table  1.  The  data  “**  '' 

from  SMS/GOES  are  collected  directly  In  real  time  // 

while  the  data  from  the  other  spacecraft  are  sent  .. 

through  a communication  system  made  up  of  the  / //  / 

USAF  Astrophysical  Tel ecomnunl cations  Network  t \ i \ II  ^ — ' 

(ATN),  a USN  commercial  line,  other  dedicated  \ \ / # A 

conmerclal  lines,  a dedicated  meteorological  line  \ \ / //  ? jj^m 

to  Moscow,  and  the  International  Ursigram  World  \ \ II  > f 

Days  Service  (lUWOS)  network.  The  SMS/GOES  data  \ \ n ! / 

are  converted  Immediately,  processed,  recorded  on  \ I " / / 

magnetic  tape,  and  Inserted  Into  the  real  time  ^ \ / * / / 

data  system.  The  data  base  consists  of  one  \ / ' / / 

minute  averages  for  the  most  recent  four  days  and  \ \ / j J 

five  minute  averages  for  32  days.  Data  purged 

from  the  data  base  are  placed  on  magnetic  tape  /f~\. 

and  archived  at  the  National  Geophysics  and  Space  ^ • H c— — I* 

Data  Center  (N6SDC)/World  Data  Center  (WDC)-A  for  TL. 

Solar  Terrestrial  Physics  In  Boulder.  Such  tapes 

will  also  be  available  from  the  National  Space  ___ 

Science  Data  Center/WDC-A  for  Rockets  and  Satel-  ** 

lltes  at  GSFC.  H 


Near  real-time  data  of  polar  cap  solar 
proton  fluxes  are  received  from  a Solar  Proton 
Monitor  on  board  NOAA  spacecraft.  These  data 
arrive  at  SEL  through  the  ATN  and  are  Inserted 
Into  the  on-line  data  base.  Fluxes  observed 
across  the  polar  caps  are  maintained  for  the 
most  recent  eight  days,  with  polar  cap  averages 
being  maintained  for  one  month  before  being  sent 
to  the  archive. 
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The  data  from  SOLRAD  11A  or  11B  Is  processed 
by  the  Naval  Research  Laboratory  and  sent  over 
the  USN  line.  Priority  Is  given  to  the  satellite 
that  Is  In  the  Interplanetary  medium;  ^nce  these 
two  satellites  are  maintained  about  180°  apart, 
one  will  always  be  outside  the  earth's  bow  shock. 
The  final  phasing  of  these  two  satellites  was  com- 
pleted on  July  20  but  station  keeping  will  be 
required  periodically. 


Fig.  4.  Schematic  overview  of  SELDAOS.  The  geostation- 
ary orbiting  spacecraft  transmit  data  directly  Into 
the  system.  The  NOAA  (ITOS)  solar  proton  data  arc 
sent  over  the  ATN  (see  text),  which  Is  part  of  the 
coaminl  cat  Ions  system.  SOLNAO  11A  and  8 (SOLNAO  HI) 
data  are  sent  over  a USN  coaMrclal  line  often  pro- 
cessing at  NNL.  TIROS  N Is  a follow-on  series  of 
satellites  which  will  replace  NOAA  satellites;  the 
first  launch  will  be  early  In  1978.  Various  aspects 
of  the  system  arc  discussed  In  the  ':cxt. 


There  arc  a number  of  ground-based  observations  which  are  fed  Into  SELOAOS.  In  fact,  over  7S  stations 
around  the  world  supply  data  of  the  following  types:  total  electron  content;  hydrogen  alpha  (6S63  I)  solar 
events,  features,  and  patrol;  discrete  frequency  solar  radio  flux  and  bursts;  spectrographic  solar  radio 
events;  solar  calcium  plage  observations;  coronal  Intensities:  white-light  sunspot  observations;  optical 
auroral  observations;  auroral  radar  backscatter;  lonosonde  observations;  high-frequency  radio  path  signal 
strengths;  sudden  Ionospheric  disturbances;  high-latitude  riometer  data;  and  magnetic  field  observations. 
Mich  of  the  ground-based  data  come  In  through  the  world-wide  lUWDS  network. 


Beginning  In  1977  and  extending  through  the  period  of  the  International  Magnetospheric  Study  (IMS), 
which  ends  December  31,  1979,  an  extensive  set  of  25  magnetometer  stations  located  In  the  Artie  and  across 
the  Americas  and  the  Pacific  will  relay  data  through  the  SMS/GOES  satellites  to  SELOADS.  These  IMS  mag- 
netometer data  files  will  be  maintained  at  the  original  10  sec  resolution  for  four  days  and  one  minute 
resolution  for  32  days  before  being  dumped  to  magnetic  tape  and  archived. 


All  Incoming  data  are  handled  by  the  computer  system  shown  In  Fig.  5.  The  data  base  Is  contained  on 
the  two  58  megabyte  disc  drives.  The  two  Nova  1200  mini-computers  are  connected  by  a multiprocessor  com- 
munications adaptor  (MCA)  and  can  transfer  data  to  and  from  each  other  at  a 75  kilobyte  rate.  The  new 
computer  will  have  floating  point  arithmetic,  two  10  megabyte  disc  drives,  and  a 70-character-per-11ne,  300- 
llnes-per-mlnute  printer.  Display  capabilities  at  the  facility  Include  Interactive  and  call  up  CRT  dis- 
plays, strip  charts,  word  messages,  and  printer  output.  Computer  (1)  In  Fig.  4 has  Input/output  Interfaces 
for  (a)  acquiring  data  from  two  SMS/GOES  satellites,  (b)  acquiring  data  from  the  USAF  ATN  line,  (c)  handling 
a 2400  baud  link  between  SELDADS  and  the  Global  Weather  Center  at  Offutt  Air  Force  Base,  (d)  Interfacing 
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with  • cowunlcatlons  line  (DIG  HIUAY)  to  the  Space  Envlronnent  Services  Center  (SESC),  which  Is  also 
located  at  NOAA,  Boulder,  (c)  handling  an  on-site  teletype,  and  (f)  handling  a remote  user  through  a 
data  phone  (DP)  terminal.  C^uter  (2)  has  similar  Interfaces  shown  In  Fig.  4 and  sends  data  to  a RAMTEK 
CRT  device  which  Is  capable  of  generating  eight  low  resolution  (256  bit  by  240  lines)  and  two  high  resolu- 
tion (512  bit  by  480  lines)  alphanumeric  and  graphic  displays.  The  final  system  will  have  three  9-track 
magnetic  tape  drives  and  one  7-track  drive.  The  SEL  Computer  Terminal  (CT)  will  allow  access  to  a high- 
speed printer,  additional  terminals,  and  the  Environmental  Research  Laboratories  CDC-6600  Computation  system. 


F1g.  5.  SE'OAOS  Computer  Systems.  Equipments  which  existed  In  1975  are  shown  as  solid  line  blocks;  dotted 
line  blocks  represent  equipments  which  are  being  procured  and  will  be  operational  In  the  fall  of  1976. 

The  line  lab  led  DIR  HIUAY  Is  a communications  link  which  connects  the  system  to  the  SESC  data  displays. 


The  required  equipment  for  a remote  user  Is  a 110  baud  (10  characters/sec  In  ASCII  Code)  acoustically 
coupled  terminal  operating  on  a full  duplex  line  and  a coupler  In  original  mode  compatible  with  a Bell  103-C 
data  set.  Normal  data  dissemination  procedures  Include  direct  computer>to-computer  links  with  the  real-time 
data  system,  user  terminals,  telephones,  teletype  reports  and  forecasts,  a weekly  summary  of  solar  terres- 
trial environment  activity  Issued  by  SESC,  and  special  requests  (Reference  5).  The  remote  user  Is 
restricted  to  a print-out  display  on  a real-time  basis,  unless  he  links  the  data  Into  his  own  computer. 

For  specific  time  periods  users  may  request  that  CRT  graphs  be  transmitted  by  telecopier  over  the  telephone. 
Of  course,  this  requires  that  the  user  have  a compatible  telecopy  receiver  compatible  with  SESC's  Xerox  4001 
equipment.  All  products  of  the  facility  are  available  through  the  mall. 

Besides  making  this  extensive  and  current  data  base  available  to  a user.  It  Is  possible  to  obtain  some 
synthesized  products.  Magnetograms  from  Individual  stations  are  useful  In  special  cases  but  the  stacked 
magnetograms  shown  In  Fig.  6 are  much  more  useful  In  understanding  the  development  of  geomagnetic  phenomena. 
For  a chain  of  stations  lying  along  a geomagnetic  meridian  (three  such  chains  will  be  operational  during 
the  IMS),  the  user  will  be  able  to  call  up  the  results  of  an  electrojet  current  model  calculation  which 
Is  consistent  with  the  observations.  At  one  specific  time  the  variation  along  the  meridian  can  be  displayed. 
Another  mode  provides  the  time  history  of  the  currents  where  the  central  latitude,  width,  and  current  Inten- 
sity for  each  electrojet  are  given. 


Much  of  the  data  available  through 
SELDAOS  was  not  previously  accessible 
until  months  after  data  acquisition.  This 
rapid  dissemination  of  space  environmental 
data  makes  new  research  more  effective  and 
provides  a rapid  understanding  of  such 
things  as  radio  blackouts,  auroral  dis- 
plays, solar  activity,  and  the  terrestrial 
effects  of  large  geomagnetic  storms,  where 
electrical  power  transmission  can  be 
Impacted. 


IV  AOIPS 

AOIPS  is  an  Interactive  system 
designed  to  provide  Image  display,  analy- 
sis, and  data  management  for  the  NASA 
Investigators  In  the  disciplines  of  mete- 
orology, hydrology,  earth  resources,  and 
oceanography.  This  digital  Image  pro- 
cessing system  has  been  developed  by  the 
Computer  Systems  Branch  of  the  Information 
Extraction  Division,  Applications  Director- 
ate, GSFC.  The  majority  of  satellite 
experiments  Involved  In  the  above  disci- 
plines produce  a series  of  line  scans  In 
one  or  more  bands  of  the  electromagnetic 
spectrum  to  provide  a data  format  similar 
to  that  used  to  produce  a TV  picture  and 
to  that  used  In  VISSR  and  SR  Instruments 
described  In  the  previous  section.  The 
more  precise  systems  such  as  AOIPS  handle 
the  data  in  a digital  fashion  rather  than 
analog.  The  resolution  of  the  resulting 
Image  depends  on  the  number  of  lines  and 
the  number  of  pixels  In  each  line  as  well 
as  the  field  of  view  of  the  Instrument. 

The  other  parameter  Is  the  number  of 
levels  of  Intensity  (shades  of  gray)  per 
pixel.  The  total  number  of  bits  gener- 
ated by  such  a satellite  Instrument  can 
be  quite  large  and  efficient  processing 
1$  extremely  Important.  AOIPS  provides 
for  Image  Information  extraction  of  512 
lines  X 512  pixels. 

The  Important  functions  which  such  a 
processing  system  must  provide  are  the 
following.  Registration  Is  Important  In 
being  able  to  superimpose  the  Images 
obtained  with  multlspectral  devices  so 
that  a composite  color  or  false  color 
photograph  can  be  obtained.  There  Is  r 
obviously  noise  or  Interference  which 
Intermittently  produces  errors  In  certain 
pixels  and  It  Is  desirable  to  correct 
these  sporadic  effects.  The  ability  to 
overlay  a grid  or  landmark  Information  Is 
necessary  to  carry  out  a proper  analysis 
of  certain  data  and  to  display  the  results 
In  a meaningful  way.  In  the  study  of  cloud 
patterns.  Ice  flows,  land  use,  soil  moisture 
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Fig.  6.  Stacked  magnetograms  during  magnetic  storms.  The  upper 
panel  contains  data  from  stations  very  near  the  magnetic  poles. 
The  middle  panel  contains  Northern  Auroral  Zone  stations  and 
low  latitude  magnetometer  data  Is  given  In  the  bottom  panel. 
This  synthesized  data  product  Is  useful  In  the  study  and  evol- 
ution of  magnetic  storms  and  sub-storms. 


and  other  similar  topics  the  ability  to  present  a time  lapse  Is  extremely  useful.  This  requires  data 
management  techniques  to  access  data  which  provide  pictures  of  the  same  scene  at  different  times  and  to 
put  on  Image  display  devices  the  appropriate  elements  to  understand  the  general  time  evolution  of  the 
desired  quantities.  Classification  of  the  ensemble  of  bits  that  comprise  a digital  picture  Is  useful  In 
order  to  reduce  the  number  of  choices  that  one  needs  to  retrieve  and  study  desired  Images,  finally,  dis- 
play of  the  resultant  selection  and  manipulation  of  the  elements  are  Important.  This  Involves  producing 
outputs  on  printers,  plotters,  TV  displays,  CRT's,  Image  recorders,  and  photographic  film. 

The  hardware  configuration  that  has  been  chosen  to  carry  out  these  extensive  tasks  Is  presented  to  show 
elements  of  a typical  system.  A more  detailed  description  of  this  system  has  been  given  by  Dalton  (Refer- 
ence 6).  A modified  General  Electric  Image  100  Interactive  Multlspectral  Image  Analysis  System  with  pro- 
visions for  a TV  scanner  which  digitizes  hard  copy  Images,  maps,  overlays,  and  other  Information  Is  shown 
In  Fig.  7.  A mere. extensive  data  management  and  display  system  which  can  link  Into  the  Image  100  system 
and  also  link  Into  the  large  IBM  360/91  GSFC  computer  through  a 4800  baud  half-duplex  b1 -synchronous  line 
Is  blocked  out  In  Fig.  8.  The  Digital  Equipment  Corporation  (DEC)  PDP-11/45  mini-computer  controls  the 
Image  100  Analyzer  Console  and  single  color  graphics  output  can  be  displayed  on  Textronix  4012  terminals. 

The  PDP-11/70  mini-computer  controls  the  GSFC  designed  Image  display  terminal.  Important  parameters  of  the 
equipment  are  given  In  the  figure  captions.  A Dicomed  recorder,  which  Is  not  shown, is  used  to  produce  both 
black  and  white  and  color  photographs  of  synthesized  products. 
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Fig.  7.  Image  100  Analysis  System.  Some  of  the  basic  parameters  of  the  elements  are:  Disk-2.4  megabyte 

capacity  with  transfer  rate  of  0.18  megabyte/sec;  tape  drives-9-track,  125  ips,  800/1600  bpi;  line 
* printer  132  columns,  1200  1pm;  4012  graphics  terminal  operated  at  1200  baud;  basic  cycle  time  of  PDP  11/45 

with  memory  management- 1 .07  psec  for  16  bit  word;  card  reader-300  cards/min. 


With  the  software  developed  for  the  system  the  following  capabilities  exist:  (1)  list  image  files, 

(2)  display  Images  on  TV,  (3)  reduce  master  tone  to  TV  size,  (4)  display  subarea  (zoom),  (5)  perform  land- 
mark register,  (6)  navigation,  (7)  cloud  tracking,  (8)  wind  output,  (9)  image  registration,  correction, 
landmarking,  (10)  build  wind  vector  overlay,  (11)  superimpose  vector  overlap  on  image,  (12)  flying  reseaus, 
(13)  looping  images,  (14)  list  wind  vectors,  (14)  film  image  output,  (15)  produce  Calcomp  plot  of  output, 
(16)  classification,  (17)  time  lapsing,  (18)  feature  computation/analysis. 

The  system  currently  supports  studies  with  the  VISSR  data  from  SMS/GOES  and  the  Multispectral  Scanner 
data  from  LANDSAT  1 & 2,  and  the  NASA  Aircraft  Program.  Some  experiments  on  Nimbus  G and  HCMM  (both  1978 
launches)  will  also  be  processed  by  AOIPS.  These  studies  encompass  forest  inventories,  dismal  swamp  signa- 
tures, power  line  routing,  census  urban  area  delineation,  watershed  survey,  water  resources  management,  soil 
moisture  analysis,  oceanographic tliermal  signatures,  and  severe  storms. 

One  synthesized  output  from  AOIPS  is  the  divergence  of  the  wind  velocity  field  at  low  altitudes  during 
Hurrican  Elaine  which  is  superimposed  on  the  clouds.  The  divergence  contours  are  shown  in  Fig.  9 in  units 
of  10'"/sec  where  H marks  positive  (inflow)  peaks  and  L marks  negative  (outflow)  peaks. 

AOIPS  has  not  been  in  existence  long  enough  to  have  standard  data  products  flowing  out.  However,  as 
these  products  are  developed  they  will  be  made  available  through  data  distribution  groups  such  as  WOAA's 
SDSB  or  NASA's  NSSDC. 


Fig.  8.  GSFC-desIgned  color  image  display  and  analysis  system.  Some  of  the  basic  parameters  of  the 
elements  are:  disks-88  megabyte  capacity  with  transfer  rate  of  0.8  megabyte/sec;  tape  drives-9-track, 

75  ips,  800/1600  bpi;  line  printer-132  columns,  300  1pm,  card  reader-300  cards/min;  teletype-10  baud; 

VT05  alphanumeric-300  baud;  VT50  alphanumeric-1200  baud;  graphics  CRT-1200  baud,  image  display  terminal- 
2 megabyte/sec;  basic  cycle  time  of  POP  11/70  with  memory  management  and  interleaving-0.69  usee  for  32 
bit  'ouble  word:  and  High  Density  Digital  Tape-14  channels,  Ih  ips. 

It  is  clear  that  exciting  and  useful  products  will  be  available  in  the  near  future. 

V IMS/Satellite  Situation  Center 

The  Satellite  Situation  Center  was  established  within  NSSDC/WDC-A  at  GSFC  to  support  the  IMS,  an  inter- 
national program  involving  some  44  countries  in  studying  the  earth's  magnetosphere  by  means  of  balloons, 
sounding  rockets,  aircraft,  ships,  satellites,,  and  ground-based  measurements  during  the  period  1976-1979. 
Although  there  are  dedicated  satellites  being  launched  by  several  countries,  including  the  core  satellites 
GEOS  (European  Space  Agency)  and  ISEE  A/B  4 C (NASA),  it  is  important  to  utilize  all  available  in-situ 
measurements  by  older  spacecraft  when  the  confluence  of  positions  provides  rare  or  unique  situations.  The 
IMS  program  has  been  discussed  recently  in  several  articles  (References  7,  8,  9,  10). 

The  main  role  of  the  SSC  is  to  study  the  predicted  orbits  of  some  50  or  more  spacecraft  which  have  oper- 
able magnetospheric  experiments  and  to  identify  interesting  configurations  of  this  ensemble  well  enough  in 
advance  so  that  the  IMS  scientific  comunity  can  plan  supporting  measurements  by  balloons,  rockets,  etc.  and 
can  insure  that  the  Important  data  from  the  older  satellites  be  acquired  during  the  periods  of  interest.  A 
more  detailed  description  of  the  SSC  and  its  activities  is  given  in  Reference  11. 

The  most  useful  satellite  orbits  to  study  for  the  above  purpose  are  high  altitude  ones,  where  apogee 
altitudes  are  at  least  12  earth  radii  (Rg).  Fortunately  most  such  satellites  have  perigees  high  enough  that 
the  atmospheric  drag  is  negligible  and  orbital  positions  can  be  predicted  accurately  6 to  12  months  in 
advance.  Based  on  the  most  recent  orbit  elements  at  the  time,  predicted  orbit  tapes  giving  geocentric 
coordinates  as  functions  of  time  are  generated  for  the  desired  Intervals  by  the  Operational  Orbit  Support 
Branch  at  6SFC  and  provided  to  the  SSC.  These  data  are- then  read  onto  a 25  megabyte  disc  to  form  the  basic 
on-line  data  ^se  for  this  work.  Software  has  been  developed  to  search  this  data  base  to  determine  which 
satellites  are  nearly  simultaneously  passing  through  regions  of  the  magnetosphere  which  are  of  primary 
Interest.  These  regions  are  the  bow  shock,  magnetopause,  magnetosheath,  magnetotail  cusp,  interplanetary, 
and  neutral  sheet.  Such  regions  can  be  defined  mathematically,  although  in  practice  the  boundaries  move 
depending  upon  the  solar  wind  and  Interplanetary  magnetic  field  parameters.  Several  coordinate  systems 
must  be  employed  to  carry  out  these  queries  and  the  geocentric  coordinates  of  the  satellites  are  converted 
to  the  appropriate  system. 

The  results  of  the  above  computations  must  be  displayed  appropriately  in  the  various  coordinate  systems. 
In  order  to  do  this  efficiently.  Interactive  graphics  terminals  (Textronix  4014)  connected  to  a Mod  Comp  IV 
computer  through  a 9600  baud  line  are  employed.  The  resultant  graphs  with  appropriate  annotation  are  photo- 
graphed from  a Textronix  611  slave  CRT  with  an  automated  35  mm  camera. 
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The  Mod  Comp  IV  com- 
puter contains  512  K bytes 
of  core  and  has  an  effec- 
tive cycle  time  of  1.4  ysec 
for  a 32  bit  double  word 
access.  The  total  system, 
which  is  used  to  support 
NSSOC/WDC-A  for  Rockets 
and  Satellites  includes: 
four  9- track  800/1600 
bpi,  125  ips  tape  drives; 
two  7-track,  556/800  bpi, 

125  ips  tape  drives;  two 
4014  graphics  terminals 
with  611  slaves;  twelve 
Textronix  4023  alphanu- 
meric terminals  operating 
at  1200  baud;  one  300  card/ 
min  reader;  one  132  column, 

96  character,  200  1pm  im- 
pact printer;  one  500  1pm 
electrostatic  printer/ 
plotter;  three  50  megabyte 
discs;  one  25  megabyte 
disc;  and  the  Mod  Comp  IV. 

One  important  display 
of  the  satellite  positions 
is  shown  in  Fig.  10  and 
illustrates  the  period  when 
Imp  H,  Hawkeye  1 and  Vela 
58  all  cross  the  magneto- 
pause within  2.2  hours  of 
each  other.  At  this  same 
time.  Imp  0 is  monitoring 
the  interplanetary  medium. 

The  bow  shock  and  magneto- 
pause boundaries  are  drawn 
automatically.  The  time 
period  and  satellite  are 
selected  and  the  orbital 
arc  is  then  drawn  by  the 
computer.  The  placement 
of  the  labels  for  each 
satellite  arc,  the  bound- 
aries, and  the  times  Fig.  9.  Divergence  of  wind  velocity  field  during  Hurricane  Elaine.  The  con- 

associated  with  the  tic  tours  are  superimposed  on  low  altitude  clouds  that  were  used  to  derive 

marks  are  selected  by  the  velocity  field.  Units  of  the  divergence  are  10’°/sec.  The  data  were  taken 

operator.  The  ecliptic  over  the  period  1330-1400  UT  on  September  12,  1974.  The  ordinate  is  geo- 
latitudes are  obtained  graphic  latitude  in  degrees  and  the  abscissa  is  east  longitude  in  degrees, 

from  other  computations 
but  are  indicated  at  the 

various  times  so  that  the  three  dimensional  information  on  the  magnetopause  boundary  crossing  is  displayed. 
Another  view  of  this  situation  for  day  104  of  1977  is  shown  in  Fig.  11.  As  stated  in  the  caption,  more 
accurate  magnetopause  and  bow  shock  crossing  times  can  be  determined  from  this  presentation.  Such  crossing 
times  are  annotated  in  the  legend  at  the  top  of  the  graph.  The  most  concise  presentation  for  this  interval 
displaying  all  pertinent  information  to  tne  required  accuracy  is  given  in  Fig.  12.  It  is  easy  to  determine 
at  a glance  the  region  of  space  occupied  by  each  of  the  four  satellites  as  a function  of  time  and  the  per- 
tinent local  time,  latitude,  and  distance  information  to  give  a three  dimensional  view  of  the  satellite 
situation. 

Prior  to  the  start  of  the  IMS,  the  SSC  studied  the  orbits  of  Imp  H S J,  Hawkeye  1,  Vela  58,  6A,  i 68 
during  1976  and  identified  18  intervals  ranging  from  13  hours  to  73  hours  in  length  when  these  satellites 
were  simultaneously  (within  2 hours)  traversing  magnetospheric  regions  which  should  provide  interesting 
measurements  to  further  our  understanding  of  the  complex  phenomena.  These  intervals  were  adopted  as  IMS 
Special  Satellite  Periods.  A publication  was  distributed  to  all  IMS  participants  showing  the  detailed 
positions  of  all  the  above  spacecraft  during  these  intervals  (Reference  12).  At  some  later  date  when  the 
actual  positions  of  the  satellites  during  1976  are  determined  accurately  from  tracking  data,  similar  plots 
to  those  shown  in  Figs.  10-12  will  be  produced  in  addition  to  other  types  of  plots  which  show  the  satellites 
relative  to  the  neutral  sheet  and  to  the  cusp  region.  In  addition,  some  important  new  satellites,  SOLRAD 
11A  & 118  and  USSR's  Prognoz  4 were  launched  in  1976  and  will  be  Included  in  the  achieved  orbit  films. 

For  1977  the  SSC  is  currently  studying  positions  of  IMP  H4J,  Hawkeye  1,  Vela  58,  SOLRAD  llA  S IIB,  the 
moon  (scientific  Instruments  left  during  Apollo  landings  are  still  operating)  and  ISEE  A/B  to  determine  the 
most  interesting  periods  for  data  collection.  When  these  studies  are  completed  , reels  of  35  mm  film  will  be 
available  to  the  IMS  community  on  request  and  two  hard  copy  reports  (one  for  each  6 months)  of  the  daily 
regionplots  shown  in  Fig.  12  will  be  distributed. 

It  is  through  this  synthesized  positional  data  that  more  meaningful  satellite  environmental  data  can 
be  obtained  and  made  available  to  interested  users. 
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I'lg.  10.  Example  of  orbit  projections  for  IMP-H,  IMP-J,  VELA  5B,  and 
Hawkeye  1 by  cylindrical  rotation  about  the  x-axis  in  the  solar  ecliptic 
coordinate  system  for  days  103  and  104,  1977.  A cursor  placed  at  any 
point  on  the  orbital  arcs  allows  the  user  to  produce  a time  tic  and  then 
the  cursor  is  used  to  place  the  time  of  this  point  (day/hour  UT)  on  the 
plot. 
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Fig.  11.  Distances  from  the  satellite  (moving  along  the  time  axis)  to  the  magneto- 
pause and  the  bow  shock  as  functions  of  UT.  Crossing  times  can  be  determined  nwre 
accurately  from  this  type  of  plot  than  that  in  Figure  1;  greater  accuracy  is  not 
necessary  since  all  boundaries  move  in  response  to  changes  in  the  interplanetary 
medium.  Positive  distances  are  on  the  concave  side  of  the  surface;  negative 
distances  are  on  the  convex  side. 
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Fig-  12.  The  bar  chart  gives  the  positions  of  all  the  satellites  in  a concentrated  format.  In  this  chart 
the  regions  occupied  by  four  satellites  during  days  103  and  104,  1977  are  shown.  Boundary  crossings  can 
be  identified  easily  with  good  accuracy,  and  latitude,  local  time,  and  distances  from  neutral  sheet  or  bow 
shock  are  given  to  locate  each  satellite  in  three  dimensions.  The  notation,  7r,  at  1600  UT  on  day  103 
means  IMP-H  is  2 earth  radii  below  the  neutral  sheet. 


V Concluding  Remarks  j 

The  purpose  of  this  paper  has  been  to  convey  techniques  and  equipment  currently  being  utilized  to  handle, 
process,  synthesize,  and  distribute  the  environmental  data  obtained  from  satellites.  All  of  the  facilities  ' i 

presented  here  except  AOIPS  are  directly  involved  with  data  dissemination.  Appropriate  addresses  have  been  ! j 

provided  so  that  any  interested  person  can  contact  the  various  facilities.  The  data  and  data  products  j 

resulting  from  environmental  satellites  form  an  important  resources  in  many  areas  of  man's  activity.  There-  i 

fore,  it  is  vital  that  the  distribution  of  this  information  be  a wide-spread  and  effective  operation.  j 
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DEVELOPMENT  AND  APPLICATIONS  OF  SPATIAL  DATA  RESOURCES 
IN  ENERGY  RELATED  ASSESSMENT  AND  PLANNING**^ 
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SUM1ARY 

Research  In  energy  related  assessment  and  planning  at  Oak  Ridge  National  Laboratory  In- 
volves Investigating  environmental  themes  at  several  levels  requiring  data  at  appropriate 
spatial  and  temporal  scales.  In  the  Environmental  Sciences  Division,  we  are  developing  a 
spatial  database  for  the  Eastern  United  States  at  the  county-subcounty  unit  level  of  resolu- 
tion. The  database  contains  Information  on  terrain,  water  resources,  climate,  land  use, 
forest  resources,  agriculture,  wildlife  resources,  critical  natural  areas,  human  population 
and  energy  uses.  Ne  have  defined  a spatial  hierarchy  of  metric,  geodetic  and  geopolitical 
scales  as  a framework  to  organizing  the  data.  Spatial  units  within  a hierarchical  level 
serve  as  building  blocks  that  can  be  assembled  or  aggregated  to  satisfy  analysis  needs. 

Building  blocks  also  allow  accessing  more  detailed  spatial  data  by  using  pointers  to  Infor- 
mation not  stored  In  the  database.  Uses  of  the  database  are  related  to  the  capability  to 
cross-reference  and  Integrate  Information  In  various  subject  sectors,  utilizing  spatial 
units  and  temporal  periods  commensurate  with  regional  themes.  An  Investigation  of  poten- 
tial changes  In  vegetation  patterns  related  to  predicted  tenperature  changes  from  Increased 
atmospheric  CO2  Is  presented  to  illustrate  an  ongoing  application  of  our  data  resources. 

Other  themes  Include  coal  extraction  In  Appalachia,  landscape  patterns,  habitat  and  popula- 
tion dynamics  of  selected  biological  species,  and  energy  facility  siting. 

INTRODUCTION 

The  production  of  ample  dean  energy  In  the  United  States  and  other  nations  of  the  world  requires 
governments  to  make  choices  concerning  the  development  of  alternative  resources  and  technologies.  Also 
related  to  the  energy  crisis  are  scientific  and  public  concerns  about  large  scale  cumulative  effects  of 
agriculture,  manufacturing,  transportation  and  urbanization  activities  on  the  environment.  The  Increased 
concern  about  environmental  quality  is  resulting  In  an  Increase  of  data  needs  and  a prollferatlo.i  of  data 
gathering.  Integrated  assessment  and  planning  requires  data  on  various  aspects  of  land,  water,  air  and 
biological  resources  available  In  compatible  formats  for  large  geographic  regions.  Our  research  Is  spon- 
sored primarily  by  the  Enerw  Research  and  Development  Administration  (ERDA)  and  is  designed  to  obtain 
coiqp rehens Ive  quantitative  information  on  the  spatial  and  temporal  distribution  of  environmental  resources 
and  to  place  these  data  Into  a standardized  spatial  database  permitting  quick  response  to  identified  needs. 
We  are  part  of  the  Environmental  Sciences  Division  at  Oak  Ridge  National  Laboratory  (ORNL)  which  has  an 
active  regional  studies  program  aimed  at  understanding  ecological  patterns  and  processes  for  integrated 
assessment  of  environmental  impacts.  ORNL  researchers  are  the  prime  users  of  our  data  resources  through 
our  collaboration  with  various  projects;  however,  we  exchange  and  share  information  with  other  groups 
having  similar  needs  and  concerns.  Our  paper  describes  how  we  organize  and  manage  large  amounts  of  di- 
verse environmental  data  to  aid  In  energy  related  assessment  and  planning. 

The  scope  and  applications  for  which  our  database  is  being  used  can  be  illustrated  by  posing  the  fol- 
lowing questions: 

• Where  are  the  optimal  areas  to  locate  energy  facilities  which  will  have  minimal  cumulative 
impact  on  the  region? 

• Are  there  natural  communities  or  agricultural  areas  located  such  that  SO2.  NO^  or  CO  emis- 
sions may  significantly  reduce  growth  or  eliminate  sensitive  species? 

• If  accelerated  burning  of  fossil  fuels  Increases  atmospheric  CO2  concentrations  enough  to 
alter  temperature  patterns,  what  changes  may  be  anticipated  In  the  distribution  of  plants 
and  animals? 

• Where  might  biological  productivity  be  great  enough  to  allow  renewable  organic  fuels,  such 
as  crop  residues  or  forestry  slash,  to  be  used  as  major  sources  of  energy? 

• Where  are  the  areas  that  possess  unique  habitat  characteristics,  e.g. , those  required  by 
rare  and  endangered  plants  and  animals?  What  Is  the  capacity  of  these  areas  to  support 
natural  populations? 

• By  comparing  areas  at  different  stages  of  urban  or  Industrial  development,  can  we  define 
the  consequences  of  urbanization  and  Industry  on  natural  plant  and  animal  conmunltles? 

As  this  partial  list  Indicates,  most  of  our  research  problems  extend  over  large  (multi-state)  areas  and 
are  concerned  with  assessing  longterm  changes  In  ecosystems  related  to  aggregated  effects  of  man's  activ- 
ities. 


^Research  sponsored  by  the  Union  Carbide  Corporation  under  contract  with  the  Energy  Research  and  Develop- 
ment Administration. 

'*'Pub11  cation  No.  901  . Environmental  Sciences  Division,  ORNL. 
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Large,  automated,  spatial  Information  systems  are  still  evolving.  They  are  often  called  SPATIAL, 
GEOGRAPHIC.  ENVIRONMENTAL  or  LAND  RESOURCE  Information  Systems.  They  a11  entail  the  manipulation  of  data 
associated  with  locations  within  large  geographic  areas.  Although  all  systems  attempt  to  represent  real 
world  patterns  and  variability,  each  system  has  Its  unique  methods  for  accomplishing  this  within  the  con- 
fines of  a digital  computer.  The  data  contents  of  these  systems  are  relatively  stable  even  though  the  uses 
of  the  data  may  be  quite  variable.  Spatial  Information  systems  support  decision-making  such  as  assessing 
Impacts,  suggesting  locations  for  specific  land  use  or  suggesting  land  uses  for  specific  locations.  These 
characteristics  are  presented  as  part  of  "Information/ Data  Handling:  A Guidebook  for  Development  of  State 
Programs"  published  this  year  by  the  U.  S.  Department  of  the  Interior. ‘ That  report,  with  supporting  docu- 
ments and  appendices,  represents  a fairly  complete  guidebook  for  developing  spatial  Information  systems 
even  though  Its  objective  was  to  aid  states  In  developing  critical  natural  area  programs. 

The  Information/Data  Handling  report  emphasizes  the  need  for  selecting  an  appropriate  spatial  scale 
to  collect,  store  and  manipulate  data  relative  to  Intended  applications.  Maps  have  been  the  traditional 
media  for  representing  spatial  Information  and  serve  to  Illustrate  the  concepts  of  scale  and  resolution. 
Small  scale  global  maps  are  valuable  In  representing  major  climatic  patterns,  blomes,  etc.  whereas  large 
scale  7-1/2  minute  quadrangle  maps  (1:24.000  - USGS)  are  very  valuable  In  representing  local  relief,  de- 
velopment and  potential  land  use  sites.  He  have  developed  several  ways  of  expressing  this  spatial  hier- 
archy of  regions  (Table  1).  The  schemes  are  based  on  units  that  are  metric  (assumes  small  areas  of  the 
earth  can  be  treated  as  planes),  geodetic  (size  of  cells  varies  depending  on  latitude)  or  geopolitical. 

The  regions  are  matched  with  map  scales  commonly  used  to  represent  each  level.  Equivalent  relationships 
were  proposed  by  Goff  et  a1.^  In  each  series,  the  area  of  the  region  Increases  exponentially  by  a factor 
of  100. 


Regular  cell  sizes  In  the  geodetic  and  metric  schemes  are  convenient  for  computer  processing,  display 
and  modeling;  however,  geopolitical  units  are  often  more  appropriate  for  data  gathering  and  socioeconomic 
considerations  as  well  as  for  action  program  Implementation.  Although  exceptions  exist,  as  a general  re- 
lationship, the  appropriate  spatial  cell  size  for  studying  regions  at  one  level  In  the  hierarchy  Is  cells 
at  the  next  lower  level.  These  relationships  of  scale  provide  a framework  for  organizing  and  relating 
spatial  data,  especially  as  regional  studies  are  broadened  to  national  and  International  scales. 


GEOECOLOGY  DATABASE 

Our  "Geoecology  Database"  Is  somewhat  unique  In  attempting  to  store  diverse  types  of  data  for  large 
geographic  areas.  Most  databases  with  broad  geographic  coverage  are  narrower  In  their  subject  content 
while  those  Including  many  subject  areas  are  limited  to  a single  state  or  smaller  area.  We  have  selected 
the  county-subcounty  as  our  spatial  unit  although  we  have  data  at  other  scales  and  formats.  Extant  data 
h.  <e  been  obtained  from  national  surveys,  national  networks  of  monitoring  stations,  digitized  maps  and 
otiie.  sources.  He  currently  have  over  500  variables  stored  In  files  with  consistent  Identifiers  and 
compatible  formats  for  counties  In  the  eastern  half  of  the  United  States  (Table  2). 

He  use  large  IBM  computers  1n  the  batch  mode  with  magnetic  tape  and  some  disk  storage.  The  Statis- 
tical Analysis  System’  developed  at  North  Carolina  State  University  Is  currently  providing  file  management 
and  statistical  analysis  capabilities.  The  Oak  Ridge  Regional  Modeling  Information  System  (ORRMIS)"  Is 
being  modified  to  handle  county  data  In  addition  to  Its  ability  to  handle  hierarchical  cell  structures, 
points,  lines  and  polygons.  We  Intend  to  use  ORRMIS  when  the  modifications  are  completed.  The  Geogra^lc 
Data  Systems  Group  at  ORNL  has  developed  numerous  computer  programs  to  analyze  and  display  spatial  data  as 
part  of  ORRMIS.  As  an  example,  to  analyze  the  effects  of  strip-mining  In  the  Appalachian  region,  that 
group  aided  In  relating  aerial  photography,  LANOSAT(ERTS) . T0P0C0M( topographic  data),  known  coal  reserves 
and  population  data  to  land  use,  aesthetic,  hydrologic  and  economic  Impacts  for  a 7-1/2  minute  quadrangle 
area.’  Computer-generated  cartographic  displays.  Including  3-D  perspective  drawings,  were  utilized  ex- 
tensively In  presenting  the  results.  The  Regional  and  Urban  Studies  Information  Center  (RUSTIC)  located 
In  the  Energy  Division  at  ORNL  maintains  a large  collection  of  computerized  socioeconomic  and  energy  re- 
lated files  which  are  available  to  us  and  to  outside  groups.'  Through  these  relationships  with  other  ORNL 

Divisions  In  data  processing,  our  project  Is  an  Integral  part  of  the  total  Information  and  analysis  activ- 

ities related  to  regional  environmental  and  energy  studies  at  ORNL.  The  next  step  of  coordination  with 
other  EROA  national  laboratories  Is  underway. 

He  are  currently  standardizing  and  documenting  the  data  files  and  assembling  software  for  our  database. 
Standardized  files  on  hand  to  date  Include  terrain,  agriculture,  land  use,  plant  and  animal  species  ranges, 
bird  surveys,  climate  and  human  population.  Files  on  water  quality  and  quantity,  air  quality,  forest  re- 
sources, critical  natural  areas  and  energy  use  are  being  processed  for  addition  to  the  database.  He  will 
be  able  to  establish  temporal  changes  with  the  database  as  we  add  data  measured  or  re-inventoried  over 
time.  Currently,  most  files  were  gathered  during  the  base  period  of  1965  to  1974.  In  addition  to  adding 
temporal  coverage,  we  anticipate  expanding  the  geographic  coverage  to  Include  the  western  states  and  to 
add  subject  files  as  required  by  research  projects.  The  system  and  data  contents  were  described  In  more 

detail  In  a paper  presented  at  the  recent  Fifth  Biennial  International  CODATA  Conference.’ 

Difficulties  encountered  In  assembling  data  from  diverse  sources  can  be  Illustrated  by  describing  the 
development  of  several  subject  sectors.  National  surveys  that  aggregate  data  to  the  county  level  present 
the  least  problems  as  source  data  for  our  database.  However,  federal  agencies  collecting  data  often  use 
different  county  Identifiers  and  different  sets  of  counties  both  of  which  must  be  resolved  as  part  of  re- 
fonnattlng  records  for  the  database.  Both  the  1969  Census  of  Agriculture  conducted  by  the  Bureau  of  the 
Census  and  the  1967  Conservation  Needs  Inventory  (CNi)  conducted  by  the  Soil  Conservation  Service  provide 
land  use  acreage  by  county  that  we  are  Incorporating  Into  our  database. 

The  CNI  used  field  surveys  to  classify  land  according  to  19  land  uses  and  according  to  conservation 
needs  (erosion  control  or  drainage)  required  to  Improve  a^lcultural  capabilities.  Unfortunately  federal 
lands,  urban  areas  and  small  lakes  were  excluded  from  the  survqy.  He  have  supplemented  the  files  with 
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•crMoat  for  thost  thrtt  classes  by  county  but  art  still  attaaiptlng  to  Idontlfy  specific  land  usa  catt- 
gorlts  of  the  federal  lands  Mbich  constitute  one-third  of  the  nation.  Naps  and  diverse  published  tables 
are  being  used  to  locate  and  describe  federal  land  according  to  the  adelnlstratlve  agency,  e.g..  National 
Park  Service^  Forest  Service.  Bureau  of  Land  Nanagewnt,  Anted  Services,  etc. 

The  Census  of  Agriculture  Is  restricted  to  fana  land  surveyed  by  mil  questional  res.  Approximtely 
m records  per  county  are  available  on  suanary  tapes  froai  the  Econoailc  Research  Service.  Ue  abstracted 
46  crop  acreage  and  yield  totals  per  county  to  represent  aiajor  crops.  Both  this  and  the  CNI  survey  have 
SM  slallar  categories  for  crops  such  as  "close  grown  crops."  "row  crops,"  "hay,"  etc.,  but  acreages  froai 
the  surveys  do  not  agree.  Differences  may  be  due  to  different  classification  schemes,  two  year  time  dif- 
ference, sampling  differences,  or  withheld  data  In  the  Census  of  Agriculture  related  to  data  privacy  con- 
siderations. Although  both  files  represent  very  useful  county  data,  the  pecularltles  associated  with  each 
must  be  documnted  and  considered  In  using  the  files. 

Developing  a climatic  subject  sector  has  presented  mny  more  difficulties  In  attempting  to  character- 
ize the  cllmte  for  each  county.  Our  desire  Is  to  store  average  seasonal  patterns  for  temperature,  precip- 
itation, moisture,  winds,  radiation  and  other  climatic  parameters.  The  sector  Is  being  created  from 
Interpolated  averages  based  on  Weather  Bureau  station  records.  Interpolated  values  from  mps  In  the  C11- 
mtlc  Atlas  of  the  U.  S.  and  derived  variables  such  as  the  Thomthwalte  evapotranspi ration  Index.  The 
Weather  Bureau's  primry  mission  Is  short-term  weather  prediction  for  populate  areas.  Consequently  the 
200-250  first  order  (most  complete  climatic  records)  stations  In  the  east  are  located  at  large  cities  or 
airports.  Temperature  and  precipitation  monthly  norms  are  available  for  the  periods  1931-1960  and  1941- 
1970  for  approximtely  1600  stations  In  the  east;  however,  regions  with  abrupt  cllmte  changes,  such  as 
near  mountains,  have  few  stations.  An  unexpected  problem  encountered  In  processing  the  station  data  was 
obtaining  a digitized  file  of  the  latitude  and  longitude  of  each  station  with  county  Identifiers.  We 
ended  up  mnually  extracting  this  Information  from  Weather  Bureau  publications. 

We  are  developing  Interpolation  algorithms  to  estimte  weather  data  for  all  counties.  To  Interpo- 
late values  for  county  centroids,  we  use  mathematical  functions  that  define  the  characteristics  of  a 
hypothetical  surface  for  the  digitized  points.  An  equation  of  a tangent  plane  for  each  digitized  point 
Is  calculated  using  the  closest  3-12  points.  Next  these  equations  for  points  surrounding  a centroid  are 
solved  to  give  values  for  each  tangent  plane  at  the  centroid;  these  values  are  averaged  to  give  the  estl- 
mted  value  for  the  county. 

Naps  entail  several  steps  In  data  preparation  prior  to  obtaining  county  values.  We  have  use^»n 
electronic  digitizing  device  to  store  coordinates  of  points  or  Isollnes  on  a mp  In  computer  rea\S'i‘ 
form.  Subsequently  these  data  must  be  Interpolated  to  counties  using  the  sam  mthods  described  ter 
station  date.  Finally  the  file  must  be  displayed  and  compared  against  the  original  mp  for  editing.  The 
resulting  Information  Is  limited  In  accuracy  relative  to  the  coifrlned  factors  of  base  mp  resolution  and 
mthemtics  of  Interpolation. 

The  cllmtic  sector  has  utilized  a variety  of  data  sources  and  analytical  tools  described  above  to 
obtain  the  Information  we  require  to  Investigate  the  Influence  of  cllmte  on  ecosystems  and  the  role  of 
cllmte  In  distributing  and  diluting  pollutants  In  the  environment.  We  anticipate,  as  demands  for  spa- 
tially distributed  data  Increase,  that  mission  oriented  agencies  such  as  the  Weather  Bureau  will  develop 
capabilities  to  produce  this  Information  from  their  files  In  digital  form.  We  also  anticipate  providing 
these  agencies  with  feedback  as  to  our  research  needs  which  could  be  met  by  addition  of  stations  or  param- 
eters being  masured.  We  acknowledge  that  there  are  deficiencies  In  our  cllmtic  sector  but  feel  that 
this  first  1teri*1on  may  provide  guidance  to  future  refinemnts. 

There  are  2660  counties  In  the  eastern  37  states  ranging  In  size  from  60  to  17,700  km*,  with  an  aver- 
age of  1750  lu>r . Som  advantages  of  the  county  as  a spatial  unit  are;  well  established  and  quite  perma- 
nent geopolitical  units;  coamon  unit  for  state  and  federal  data  collection;  uniform  In  size  In  the  east; 
often  used  as  managemnt  units  for  environmntal  policy;  boundaries  often  approximte  geophysical  features; 
and  county  outlines  are  available  In  computerized  form.  Disadvantages  Include:  occasionally  names  or 
boundaries  change;  a few  counties  are  large  or  heterogeneous;  some  models  use  grids  of  uniform  cell  size; 
and  Indapendent  cities  representing  some  urbanized  administrative  areas  are  treated  differently  relative 
to  data  collections.  We  have  addressed  the  problem  of  large  counties  by  proposing  a set  of  standard  sub- 
county units  (SCU's).  By  subdividing  79  counties  based  on  size  and  geographic  heterogeneity,  the  largest 
subcounty  unit  In  the  east  was  reduced  to  6,900  km*  and  the  average  to  1660  km*  while  the  number  of  SCU's 
Increased  to  2800.  We  are  entering  some  data  for  these  smiler  units  and  using  them  In  cartographic  dis- 
play. Counties  In  the  West  are  often  large  and  heterogeneous  and  this  limits  their  usefulness  as  spatial 
cells  unless  a common  set  of  SCU's  can  be  defined. 

Data  sunmrlzed  at  the  county  or  subcounty  scale  of  resolution  can  be  analyzed  to  Investigate  a 
variety  of  "mere"  patterns  and  processes.  The  Influence  of  cllmte  on  broad  vegetation  and  crop  zones 
can  be  effectively  described  In  terms  of  such  analysis.  Nore  direct  assessment  related  patterns  and  proc- 
esses, such  as  the  Influence  of  temperature  regim  on  the  feasibility  of  development  of  given  energy  tech- 
nology alternatives,  can  also  be  addressed  using  county  level  data  as  "building  blocks"  for  the  analysis. 

In  this  usage,  counties  can  be  viewed  as  sampling  "plots"  for  nation-wide  systems  analysis. 

The  Enej^  Division  at  ORNL  has  used  county-level  studies  to  Identify  candidate  counties  for  energy 
developmnt.’  These  studies  located  counties  based  on  demnd,  water  availability  and  other  criteria  which 
potentially  would  support  an  energy  facility.  We  anticipate  refining  these  predictions  based  on  considera- 
tions of  local  and  regional  environmental  Impact.  The  resulting  list  of  counties  would  then  require  ad- 
ditional studies  at  a finer  scale  of  resolution  prior  to  the  tim  at  which  a power  company  or  public 
utility  may  explore  tracts  potentially  available.  Local  constraints  would  narrow  the  final  selection; 
however,  this  sequence  will  result  on  establishing  data  files  for  use  In  Impact  statement  preparation. 

A vast  amount  of  Information  exists  about  our  environmnt.  The  "National  Inventory  of  Biological 
Nonitoring  Program"  being  conducted  at  ORNL  has  canvassed  over  3000  biomonitoring  and  baseline  projects 
In  the  United  States.  The  "National  Index  of  Data  Bases"  coaplled  by  Lawrence  Livermore  Laboratory 
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contains  ovar  4000  descriptions  of  databases  related  to  energy  and  environment.  The  Ecological  Sciences 
Information  Center,  the  Environmental  Response  Center  and  other  Information  centers  at  ORNL  provide  bib- 
liographic and  state-of-the-art  literature  reviews  which  contain  valuable  numeric  data  to  complement  our 
spatial  database.  Our  own  activities,  sponsored  by  both  ERDA  and  the  National  Science  Foundation  (NSF)  In 
conjunction  with  the  International  Biological  Program  (IBP),  have  generated  nearly  300  data  sets.*  These 
diverse  data  sourcas  are  Individually  and  collectively  extremely  valuable;  however,  these  data  are  often 
not  readily  available,  have  Incomplete  documentation  or  were  not  designed  to  be  compatible  with  other  re- 
lated studies.  Accumulating  these  files  Into  a single  standardized  database  would  be  costly  and,  because 
of  the  uniqueness  of  each  data  set,  scientifically  undesirable.  Figure  1 represents  the  relationships  of 
our  database  and  data  sources.  Ue  are  selecting  preferred  data  sets  from  these  diverse  sources  based  on 
geographic  coverage,  subject  content,  sampling  time  period,  sampling  techniques  and  availability  for  entry 
Into  our  database.  Even  though  we  do  not  Intend  to  enter  the  m^orlty  of  existing  environmental  data  Into 
our  files,  we  consider  these  sources  of  Information  to  be  an  Integral  part  of  the  resources  available  for 
our  research  efforts. 

The  county,  or  other  spatial  cell,  can  be  used  as  an  effective  repository  unit  for  locating  and  as- 
sembling Information  about  more  detailed  aspects  of  the  cell.  The  county  record  can  contain  Information 
about  the  Information  pertaining  to  the  county  — whether  a particular  type  of  data  is  available  within 
that  county,  how  much  data  are  available,  the  scale  of  resolution,  dates  of  data  collection,  as  well  as 
other  descriptors  of  the  data.  As  more  data  are  collected  at  refined  scales,  "pointers”  to  these  data  can 
be  added  to  the  county  records.  In  this  way  the  county  can  serve  as  a reservoir  of  pointers  to  more  re- 
fined data  and  the  county  record  can  be  utilized  as  a repository  of  Information  about  data  that  may  be 
stored  In  auxiliary  files,  retained  In  parent  agency  files,  or  potentially  available  In  other  ways.  As  an 
example,  we  Intend  to  store  the  location  and  Identifiers  of  Critical  Natural  Areas  as  administered  or  In- 
ventoried by  states.  The  Nature  Conservancy,  The  National  Park  Service,  Experimental  Ecological  Reserve 
and  others  In  our  database.  We  will  not  store  detailed  information  or  data  from  these  areas  but  will  ob- 
tain the  data  from  the  administrative  agency  if  we  identify  counties  in  which  a user  requires  the  more 
detailed  Information.  Therefore,  our  approach  to  storing  selected  county-level  data  is  an  efficient  means 
of  getting  to  the  more  detailed,  voluminous  data  that  are  not  stored  in  the  database. 


applications 

i 

Ue  are  attempting  to  use  the  hierarchical  approach  to  the  development  of  our  spatial  database  to  ad-  i 

dress  issues  of  regional,  national  and  global  significance.  An  example  of  anticipated  impacts  form  fossil 
fuel  burning  Illustrates  the  use  of  the  database  and  our  approach.  All  nations  have  been  contributing  to 
a nearly  exponential  increase  of  CO2  in  the  atmosphere  in  recent  decades.  The  nature  of  atmospheric  mixing 
and  seasonality  causes  the  worldwide  CO2  increase  to  show  quicker  increase  and  greater  seasonal  amplitude 
in  the  high  northern  latitudes  than  in  either  the  tropical  or  southern  hemispheric  regions.  This  is  due 
to  atmospheric  CO2  being  drawn  down  by  photosynthesis  during  sunner  months  on  the  broad  northern  conti- 
nents and  then  being  regenerated  by  respiration  during  the  rest  of  the  year.  About  half  of  the  amount  of 
CO2  added  from  fossil  fuel  burning  has  been  balanced  by  increased  absorption  by  the  oceans  and/or  the 
photosynthetic  plant  cover  of  the  lands.  Forests  appear  to  be  the  most  effective  of  all  ecological  sys- 
tems in  exchanging  and  storing  carbon  as  biomass.  However,  this  capacity  for  increased  absorption  is  being 
reduced  by  the  clearing  of  forests.  The  burning  and  indirect  oxidation  (decomposition  by  microorganisms) 
of  carbon  pools  formerly  associated  with  forests  are  contributing  to  an  increase  in  atmospheric  CO2  level. 

Modeling  of  the  global  carbon  balance  by  several  groups  leaves  little  doubt  of  drastic  increases, 
possibly  doubling,  of  CO2  in  the  next  century  unless  energy  technology  turns  away  from  oil,  gas  and  coal 
with  the  same  abruptness  that  we  are  turning  to  these  fossil  fuels.  There  is  some  uncertainty  about  the 
degree  of  wanning  that  could  be  expected  per  doubling  of  CO2.  with  the  possibility  that  particulate  pollu- 
tion might  either  partly  counteract  or  conceivably  exacerbate  the  changes.  But  recent  assessments  (e.g., 

Baes  et.  al.)’  leave  little  doubt  that  climate  change  will  be  significant  and  spatially  variable  over  the 
coming  decades.  Ground  Investigations  need  to  be  initiated  to  document  and  predict  shifts  in  life  and 
crop  zones  resulting  from  climatic  changes.  Systematic  monitoring  of  trends  in  forest  or  other  ecological 
cover  need  to  be  expanded  to  Include  tropical  regions  and  other  areas  where  data  are  currently  limited  in 
quantity  and  quality.  The  analysis  of  satellite  data,  with  associated  aircraft  and  ground  truth  calibra- 
tion, is  of  high  priority  for  supplementing  more  traditional  data  sources  to  aid  in  improving  our  knowledge 
of  the  carbon  cycle. 

At  present,  our  database  is  being  used  to  investigate  productivity,  decomposition  and  mineralization 
rates  in  the  eastern  deciduous  forest  to  aid  in  understanding  the  carbon  cycle.  Carbon  pools,  photosyn- 
thesis rates  and  respiration  rates  are  being  determined  for  the  vegetation.  Landscape  diversity  as  re- 
lated to  climate,  terrain  and  man's  conversion  of  natural  areas  to  agriculture  and  urban  uses,  is  also 
being  studied.  Knowledge  of  relationships  of  vegetation  and  climate  is  necessary  to  predict  temporal 
changes  in  the  spatial  patterns  of  vegetation  and  crops  under  various  projected  climatic  shifts.  Having 
terrain,  forestry,  agriculture  and  climate  variables  available  on  common  spatial  units  and  In  compatible 
formats  allows  ready  cross-reference  and  combining  of  data  from  these  various  subject  sectors. 

Using  the  established  requirements  for  a crop  (e.g..  temperature,  moisture,  soil,  and  associated  vari- 
ables for  growing  corn),  we  are  attempting  to  predict  changes  in  the  growing  zone  and  differential  produc- 
tivity within  the  growing  zone  resulting  from  temperature  changes  Induced  by  factors  such  as  increased 
atmospheric  CO2.  tounty  com  yields  can  be  adjusted  to  reflect  the  predicted  seasonal  temperature  changes 
with  some  counties  potentially  being  unable  to  produce  com  and  others  switching  from  other  crops  to  com 
production.  These  switches  In  the  spatial  pattern  of  the  "corn  belt"  along  with  other  growth  modifications 
would  be  used  to  characterize  a new  hypothetical  com  region.  The  region's  character,  using  counties  as 
spatial  cells,  will  be  based  on  the  aggregated  properties  of  the  cells.  Additional  regional  characteris- 
tics for  the  larger  area  may  emerge  that  are  not  readily  apparent  from  any  one  cell  In  the  region.  This 
com  region,  with  its  emergent  properties  and  larger  geographic  coverage,  will  then  become  a spatial  unit 
to  be  used  in  studies  at  the  national  or  global  level.  This  example  shows  how  the  hierarchical  approach 
Is  being  used  to  aggregate  spatial  units  In  an  environmentally  meaningful  way  to  Integrate  studies  con- 
ducted at  various  regional  scales. 
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DISCUSSION 

In  October  1975.  a week-long  workshop  on  energy  related  regional  analysis  and  assessment  was  held  In 
Oak  Ridge.  Tennessee,  with  representation  from  most  of  the  ERDA  National  Laboratories.  In  consideration  of 
EROA's  interest  and  role  In  development  of  such  a database,  the  following  statement  was  drafted  by  the  task 
force  on  Regional  Environmental  Data  Bases: 

Recognizing: 

(1)  That  decisions  regarding  energy  research  and  development  should  be  made  on  a 
rational  basis  with  due  regard  for  the  optimum  utilization  of  all  human  and  natural  re- 
sources. as  well  as  for  minimum  desecration  of  land  and  minimum  degradation  of  environ- 
ment; 

(2)  That  such  rational  decisions  require  a base  of  spatial  and  temporal  data  re- 
lating to  human  needs  and  conditions  and  to  the  environmental  resources  fulfilling  these 
needs ; 

(3)  That  Interagency  cooperation  and  coordination  In  such  an  Infonnatlon  base  Is 
necessary  to  avoid  duplication  of  Inventory  efforts,  to  Insure  adequate  "quick  response" 
availability  of  data  on  the  multiple  subject  sectors  that  must  be  considered  In  respon- 
sible rational  resources  development,  and  to  Insure  optimal  design  of  agency  Inventory, 
planning  and  program  Implementation  efforts; 

(4)  That  ERDA-D6ER.  through  the  National  Laboratories.  Is  uniquely  qualified  to 
perform  the  collation  and  synthesis  of  selected  data  from  the  diverse  array  of  data 
presently  available;  and  to  determine  data  needs  and  mediate  In  the  design  and  Imple- 
mentation of  future  Inventory  efforts  to  Insure  maximal  relevancy  of  data  to  energy 
development,  planning  and  assessment  needs. 

The  Task  Force  on  Regional  Environmental  Data  Bases  reconmends  that  ERDA-06ER  foster 
the  design  and  development  of  a national  spatial  data  base  for  energy  resources  planning, 
assessment,  and  analysis.  Development  of  this  data  base  should  be  undertaken  as  a co- 
operative Interlaboratory  effort  with  appropriate  division  of  responsibility  and  resources 
among  the  laboratories. 

With  respect  to  coordination  with  other  agencies,  we  recommend  a "distributed  data 
base"  conceot  and  approach  with  extensive  data  on  natural  and  human  resources  as  well 
as  environmental  conditions  residing  In  the  responsible  agency.  For  ERDA  analysis  and 
assessment,  the  database  should  be  developed  as  a spatial  data  management  system  with 
quick  response  capability,  high  exportablllty  (simple  structure),  and  an  optimal  level 
of  spatial  and  temporal  standardization,  for  "preferred  data"  to  be  assembled  and  syn- 
thesized from  the  "distributed  data  base"  of  other  agencies  and  sources. 

Although  this  set  of  recommendations  does  not  represent  official  ERDA  policy.  It  does  Indicate  the  central 
role  of  energy  related  assessment  and  planning.  Whereas  In  many  other  agencies  one  dear  theme  arising 
from  a traditional  discipline-oriented  mission  renders  consideration  of  a11  other  types  of  Information 
dearly  subordinate.  In  energy  related  assessment  and  planning,  ERDA  (possibly  along  with  the  Environ- 
mental Protection  Agency  and  the  Council  on  Environmental  Quality)  must  balance  concerns  for  a11  aspects 
of  regional  development. 

We  have  attempted  to  define  a spatial  hierarchy  as  a key  In  organizing  and  using  the  vast  amount  of 
environmental  Information  available  for  energy  related  assessment  and  planning.  These  hierarchical  rela- 
tionships can  guide  In  aggregating  data  to  larger  regions  or  In  "zooming"  Into  smaller  areas.  The  spatial 
cells  within  a level  of  the  hierarchy  represent  building  blocks  which  can  be  assembled  Into  regions  ac- 
cording to  various  needs.  Each  block  can  contain  selected  data  for  key  environmental  parameters  to  allow 
analysis  at  that  level  of  resolution.  The  blocks  can  also  contain  pointers  to  other  sources  of  Informa- 
tion not  stored  In  the  database.  This  scheme  has  allowed  us  to  build  an  environmental  database  utilizing 
counties  as  geopolitical  spatial  cells  even  though  this  level  of  resolution  does  not  contain  all  the  de- 
tailed data  for  studies  at  local  levels.  Using  county  cells,  we  are  Investigating  ecological  patterns  and 
processes  of  regions  covering  several  states  relative  to  the  Impacts  of  energy  development.  Our  confidence 
In  this  scheme  Ties  In  being  able  to  Identify  potential  areas  for  more  detailed  study  and  to  access  addi- 
tional Information  using  the  hierarchical  relationships  and  pointers. 

Construction  of  spatial  databases,  such  as  ours,  can  bring  about  substantial  Integration  and  coordina- 
tion among  agencies  and  groups  having  specific  or  general  Interest  In  the  themes  addressed  by  a database. 
Most  of  this  Integration  occurs  In  using  the  database  but  a large  part  may  also  result  from  designing  the 
database,  defining  Its  subject  content  and  assembling  the  data  files.  The  database  can  become  an  Important 
project  management  tool  by  bringing  about  more  effective  conmuni cat Ions  between  the  Individuals  with  di- 
verse backgrounds  and  Interests  Involved  In  data  acquisition  and  synthesis.  This  Is  especially  true  at 
International  groups  such  as  those  sponsored  by  NATO  cooperate  In  building  databases  to  study  global  proc- 
esses . 

We  have  not  discussed  many  of  the  technical  decisions  Involved  In  developing  a spatial  database  such 
as  georeferencing.  digitizing,  file  structures,  cartographies  and  many  others.  Neither  have  we  discussed 
the  many  "people’’  problems  associated  with  automated  systems  which  may  outweigh  the  technical  problems  In 
determining  the  success  of  such  systems.  In  light  of  the  worldwide  needs  for  regional  environmental  data. 
It  Is  Imperative  that  groups  such  as  AGARD  continue  to  participate  In  planning.  Teading,  organizing  and 
controlling  the  development  of  spatial  Information  systems. 
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AGENCIES  AND  INSTITUTIONS  COLLECTING  INFORMATION 


Figure  1.  Relation  of  data  sources  for  building  databases. 


Table  1.  Spatial  hierarchies  of  regions 
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Table  2.  Geoecology  database  contents  as  of  July  1976 

Data  set  description 

Year  and  source 

Variables 
Area  /record 

Records 

/county 

County  totals 

Agricul tura1  crop  acreage  and  production 
Livestock  and  poultry  Inventory  and  sales 
Land  use 
Land  capability 

Land  use  by  land  capability  class 
Population  by  sex  and  five  year  age  classes 
County  codes,  names,  area  and  centroids 

Station  data 

Heather  station  locations 
Normal  monthly  maximum  temperature 
Normal  monthly  minimum  temperature 
Normal  monthly  precipitation 
Normal  monthly  heating  degree  days 
Normal  monthly  average  temperature 
Evapotranspiratlon  Index 
Moisture  Index 
Breeding  bird  survey  routes 
Breeding  bird  survey  counts 

Digitized  nap  data 
potential  natural  vegetation 
Tree  species  ranges 
Maimial  species  ranges 
Land  surface  form 
Underlying  bed  rock 
Principal  soils 
Glacial  deposits 
Karst  terrain  occurrences 


1969  Ag.  Census 
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1 

1967  Cons.  Needs  Inv. 
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33 

1 

1967  Cons.  Needs  Inv. 

USA 

35 

1 

1967  Cons.  Needs  Inv. 
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28 

12 

1970  Census 

USA 

45 

1 

1970  FIPS 
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14 

1 

1970  Weather  Bureau 

East 

10 
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1931-60  Weather  Bureau 
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19 

.1 

1931-60  Weather  Bureau 

East 

19 

.1 

1931-60  Weather  Bureau 

East 

19 

.9 

1931-60  Weather  Bureau 

East 

19 

.6 

1931-60  Weather  Bureau 

East 

19 

.6 

Thornthwalte  Equation 

East 

19 

1 

Thornthwalte  Equation 
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19 

1 

1966-74  Fish  A Wildlife 

East 

25 

.5 

1966-74  Fish  A Wildlife 

USA 

25 

30.6 

1969  KUchler's  Map 

East 

8 
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1971  Little's  Atlas 
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6 
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Various  sources 

S.E. 

6 
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23 

1 

1970  National  Atlas 
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1 
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Evaluated  Numerical  Data  for  the  SST  and  Chlorofluorocarbon  Problems:  ' 


A case  study  of  how  to  help  the  engineer  and  the  modellers 


David  Garvin  and  Robert  F.  Hampson 
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National  Bureau  of  Standards 
Washington,  DC  20234 


SUMMARY 

Mathematical  simulations  (models)  of  the  stratosphere  have  been  a primary  tool 
in  the  analysis  of  potential  pollution  of  the  stratosphere  from  high  flying  aircraft, 
rockets  and  the  release  of  organic  chlorine  compounds  at  the  surface  of  the  earth. 

These  models  require  large  amounts  of  numerical  data  about  the  meteorology  and 
chemistry.  The  chemical  data  comes  from  laboratory  measurements  and  must  be  in- 
terpreted and  made  available  to  the  user  community  in  an  understandable  form  and, 
preferably,  as  recommended  values. 

How  chemical  data  were  supplied  to  the  Climatic  Impact  Assessment  Program  of  the 
U.S.  Department  of  Transportation  is  described  here  with  emphasis  on  the  role  played 
by  the  Chemical  Kinetics  Information  Center  of  the  National  Bureau  of  Standards. 

This  included  planning,  identification  of  needed  measurements  and  available  measure- 
ments, determination  of  the  needs  of  users,  evaluation  of  data,  interpretation  of 
results  for  non- specialists  and  distribution  of  tables  of  rate  data.  This  type  of 
role  is  suitable  for  an  information  analysis  center  in  any  large  scale  interdisciplinary 
program. 

1.  Introduction 

This  paper  recounts  the  roles  a scientific  discipline  oriented  information 
analysis  center  has  played  in  a multi- technology  program  which  had  a clearly  defined 
applied  mission.  These  roles  are:  advisor  to  the  program  managers,  coordinator  of 

experimental  studies,  evaluator  of  data  and  communicator  with  the  user  groups. 

Similar  roles  are  appropriate  for  other  data  centers,  they  are  desirable  for  effi- 
cient program  management  and  can  be  of  great  benefit  to  the  centers. 

The  center  is  the  Chemical  Kinetics  Information  Center  (CKIC)  at  the  U.S. 

National  Bureau  of  Standards  (NBS) . The  Program  with  which  it  has  interacted  is  the 
Climatic  Impact  Assessment  Program  (CIAP)  of  the  U.S.  Department  of  Transportation. 

These  are  described  below,  briefly,  to  provide  a background  for  discussion  of  the 
interaction. 

1.1  The  Center.  The  Chemical  Kinetics  Information  Center  is  part  of  the  National 
Standard  Reference  Data  System  which  is  managed  by  the  Office  of  Standard  Reference 
Data  at  NBS.  The  Center  was  established  in  1963  and  is  located  in  the  NBS  Physical 
Chemistry  Division.  The  location  is  important  because  the  Physical  Chemistry  Divi- 
sion has  extensive  experimental  programs  in  gas  phase  reaction  kinetics,  photo- 
chemistry and  ultraviolet  spectroscopy.  Thus,  expertise  is  readily  accessible  for 
most  problems  faced  by  the  Center. 

The  Chemical  Kinetics  Information  Center  collects,  indexes  and  abstracts  pub- 
lished papers  and  reports  on  measurements  of  rates  of  chemical  reactions.  It  pub- 
lishes tables  of  data  and  bibliographies  and  also  responds  to  requests  for  informa- 
tion. To  date  it  has  accumulated  a data  base  of  some  26,000  papers,  mostly  in 
machine  readable  form.  This  data  base  is  the  raw  material  used  for  its  other  opera- 
tions. To  be  useful  for  problems  of  current  interest  it  is  mandatory  that  the  data 
base  be  kept  up  to  date.  This  combination  of  technical  expertise  plus  publication 
and  distribution  capabilities  is  ideal  for  supporting  an  applied  problem. 

1.2  The  Program.  The  Climatic  Impact  Assessment  Program  (CIAP)  was  established  in 
1970-1971  by  direction  of  the  U.S.  Congress  and  was  charged  with  determining  the 
potential  ecological  impact  of  propulsion  effluents  from  fleets  of  supersonic  trans- 
ports flying  in  the  stratosphere.  The  task  of  CIAP  was  to  reach  scientific  con- 
clusions that  could  be  used  for  establishing  technical  and  operational  standards  for 
future  air  travel  that  would  assure  a chosen  level  of  atmospheric  quality.  For  its 
work  the  program  drew  on  ten  U.S.  federal  departments  and  agencies  and  seven  governmental 
organizations  in  other  countries.  Some  one  'housand  scientists  and  engineers  contributed 
to  the  technical  outputs. 

It  was  necessary  to  collect  data  on  and  solve  stratospheric  problems  involving 
chemistry,  trace  constituent  concentrations,  meteorology,  radiation  flux,  engine 
emissions,  biological  effects  of  ultraviolet  light  (on  plants,  organisms  and  man), 
climatic  changes  (and  their  effects  on  crop  production),  and  economic  projections  of 
likely  SST  fleet  sizes  [1].  The  results  of  these  studies  were  published  in  a 
"Report  of  Findings"  [2],  six  technical  monographs  [3]  and  the  proceedings  of  four 


interdisciplinary  conferences  [4] , as  well  as  many  papers  published  separately  by  the 
contributors. 

In  addition  to  laboratory  experiments,  atmospheric  measurements,  field  tests  and 
modelling,  a program  as  diverse  as  CIAP  requires  a substantial  organization  to  handle 
coordination  of  the  work,  publications  and  decisions  about  what  to  do.  Interpretation 
of  technical  results  is  needed  by  persons  at  all  levels  of  this  hierarchy. 

As  CIAP  came  to  an  end,  another  problem  in  stratospheric  chemistry  arose.  This 
is  the  effect  of  chlorocarbon  emissions  on  the  ozone  layer.  The  programmatic  organi- 
zation within  the  U.S.  to  study  this  problem  is  more  fragmented  than  for  the  case  of 
the  SST.  The  models  are  slightly  different  because  the  source  is  at  ground  level, 
but  the  technical  requirements  for  chemical  data  remain  the  same.  The  only  change  is 
that  an  additional  set  of  chemical  reactions  must  be  incorporated  in  the  models.  The 
descriptions  of  data  needs  and  information  analysis  center  actions  that  follow  empha- 
size CIAP  but  are  applicable  to  the  new  problem  as  well. 

2 . Stratospheric  Chemistry  and  Data  Needs 

The  stratosphere  is  a region  of  intense  chemical  activity.  For  the  present 
discussion  the  details  are  not  important.  It  is  sufficient  to  note  that  the  chem- 
istry involves  50  or  so  molecules  and  free  radicals,  present  at  the  part  per  million 
concentrations  level  or  less,  that  can  interact  in  150  to  200  elementary  chemical 
reactions  and  photodissociation  processs.  The  principal  interest  has  been  in  how 
these  reactions  control  the  amount  of  ozone  in  the  stratosphere,  thereby  determining 
the  intensity  of  ultraviolet  light  that  reaches  the  earth's  surface. 

In  order  to  study  this  problem  mathematical  simulations  (models)  are  used.  They 
combine  the  chemistry  and  meteorology  in  a prediction  of  concentration  levels  of 
trace  substances  as  a function  of  geographic  position  and  altitude. 

The  models  require  large  amounts  of  numerical  input  data.  The  results  of  the 
modelling  can  be  sensitive  to  the  chemistry  included  and  the  numerical  parameters 
concerning  them.  Because  important  decisions  may  be  based  on  the  outputs  of  the 
models  it  is  necessary  to  have  high  quality  input  data. 

For  simple  chemical  models  (minimal  meteorology)  the  following  types  of  data  are 
needed: 

(1)  The  chemical  mechanism,  i.e.,  the  specific  chemical  and  photochemical 
processes  that  should  be  included  to  make  the  outputs  meaningful. 

(2)  The  rate  constant  (as  a function  of  temperature)  for  each  elementary  chemical 
reaction  in  the  mechanism. 

(3)  The  optical  absorption  cross  section  for  each  molecule  that  absorbs  visible 
or  ultraviolet  light. 

(4)  The  efficiency  of  photodissociation  (quantum  yield)  for  each  molecule 
that  can  be  decomposed  by  visible  or  ultraviolet  light. 

(5)  The  intensity  of  solar  radiation  as  a function  of  altitude,  geographic  position, 
time  of  day  and  season. 

(6)  The  temperature  and  pressure  regimes  under  which  the  chemical  processes  occur. 

Our  concern  in  the  remainder  of  this  paper  will  be  with  data  derived  from  labora- 
tory measurements,  items  (1)  through  (4),  above,  who  supplies  them,  who  uses  them  and 
how  a data  center  can  organize  this  transfer  of  information. 

r . Measurer:;  and  Users  of  Data 

The  participants  in  a program  can  be  classed  either  as  measurers  (suppliers  of  data) 
or  users.  Some  persons  are  members  of  both  groups,  but  there  is  rarely  any  doubt  about 
their  function  at  a particular  time. 

3.1  Measurers.  The  measurement  community  for  atmospheric  chemistry  consists  of  many  small 
research  groups  (two  to  six  persons  each)  in  university,  government  and  industrial  lab- 
oratories. Their  members  are  predominately  chemical  kineticists  and  photochemists.  They 
may  be  identified  easily;  they  publish  scientific  papers  devoted  to  explaining  fairly 
simple  physical  and  chemical  systems.  They  produce  data  of  the  first  four  types  listed 

in  the  previous  section  and  also  develop  methods  of  measurement.  For  CIAP  there  were 
chemical  research  groups  in  over  100  institutions. 

The  information  analysis  center  is  part  of  this  community.  It  is  relatively  easy 
to  establish  rapport  with  the  pertinent  research  groups,  and  it  is  important  to  cultivate 
them:  they  provide  the  data  retailed  by  the  center  to  the  users. 

3.2  Users.  The  user  community  is  more  diverse  but  may  be  smaller  and  more  centralized 
than  the  measurers.  One  major  task  of  a center  is  to  find  and  to  keep  in  contact  with 
this  community. 


The  users  of  cheaical  data  for  modelling  of  the  stratosphere  fall  into  several  classes 
as  described  below.  Probably  there  will  be  analogous  classes  for  any  applied  program. 


(a)  Modellers.  In  the  CIAP  orogram  there  were  at  least  10  independent  groups 
making  mathematical  simulations  oi  .he  stratosphere.  Such  groups  include  meteorologists, 
mathematicians,  computer  specialists  and  often,  but  not  always,  chemists.  Their  concern 
is  with  the  dynamics  of  the  atmosphere.  They  should  not  have  imposed  upon  them  the 
burden  of  selecting  chemical  kinetics  data.  These  groups  are  the  classical  primary 
users  in  a mission  oriented  program. 


(b)  Combustion  engineers.  This  group  is  composed  of  persons  concerned  with  engine 
tests  and  with  the  short  time  scale  dispersal  of  effluents.  Often  they  model  their  ex- 
periments and  usually  they  use  chemical  inputs,  although  these  may  be  less  extensive  than 
for  other  classes  of  users.  The  size  of  this  class  is  unknown,  but  it  was  not  large. 


(c)  Chemists.  A surprising,  major  class  of  users  of  kinetics  data  is  composed  of 
the  chemists  making  laboratory  measurements,  i.e.,  the  measurement  community.  These 
persons  must  be  considered  in  the  planning  of  an  applied  program,  they  are  heavy  users 
of  data. 


Data  are  used  by  chemists  in  several  ways.  One  is  in  the  design  of  experiments: 
the  measuring  techniques  must  match  the  expected  time  scale  and  the  concentration  levels. 
Another  is  in  interpretation  of  results  and  reduction  of  data.  Some  overall  chemical 
change  is  measured  in  an  experiment.  Very  often  five  or  more  elementary  reactions  com- 
bine to  produce  this  change.  The  rates  of  the  elementary  steps  are  wanted  as  input  for 
the  models.  But  usually  one  can  extract  information  on  only  one  or  two  of  these  from  an 
experiment.  The  rate  constants  remaining  reactions  are  taken  as  known.  Thus,  tabulated 
data  can  provide  standardized  base  points.  A third  application  is  qualitative.  The  data 
are  used  in  developing  explanations  (mechanisms)  of  atmospheric  behavior.  Tabular  data 
can  indicate  which  processes  could  or  could  not  be  important.  This  is  a screening 
function. 


(d)  Regulatory  agencies.  There  is  a small  class  of  persons  in  regulatory  agencies, 
ad  hoc  study  groups  and  international  committees.  They  need  to  understand  the  chemistry 
and  predictions  based  on  it,  want  information  that  can  be  used  in  developing  methods  of 
monitoring  or  use  it  in  setting  regulations  or  standards. 


(e)  Other  applications.  Some  technologists  working  on  problems  far  removed  from 
stratospheric  chemistry  found  the  chemical  data  to  be  useful.  Such  spin-off  may  never 
benefit  the  program  for  which  the  data  were  developed,  but  it  is  a source  of  ideas  for 
future  information  analysis  center  activities. 


Interactions  between  a program  and  an  information  analysis  center 


Many  steps  must  be  taken  before  it  is  possible  to  provide  modellers  with  reliable 
data.  Much  more  than  information  transfer  is  necessary.  The  needs  of  the  users  must  be 
determined  and  matched  with  existing  data  or  possible  experiments.  Laboratory  measurements 
must  be  made.  They  must  be  assessed  for  validity  and  selections  made  among  discordant 
data.  Then  the  material  must  be  put  into  a form  acceptable  to  the  users.  Finally  it 
must  get  to  them  and  be  used. 


The  role  of  the  Chemical  Kinetics  Information  Center  in  this  process  is  described 
below  as  an  example.  That  role  developed  by  trial  and  error.  Interaction  with  the  users 
and  with  the  management  of  CIAP  shaped  it.  Both  groups  pointed  out  important  things  that 
should  and  could  be  done. 


Four  interactions  are  identified  and  described  below.  They  occurred  throughout  the 
course  to  the  program.  A chronological  account  is  not  given  except  where  it  helps  explain 
a particular  interaction. 


4.1  The  center's  role  as  program  advisor.  Early  in  the  program  the  CIAP  managers  de- 
termined  that  a detailed  study  of  stratospheric  chemistry  would  be  necessary.  Th»y  then 
asked  the  following  basic  questions: 


(a)  What  chemistry  is  important  and  what  can  be  ignored? 


(b)  What  is  known  about  these  chemical  reactions;  for  which  are  there  usable  data 
and  for  which  are  measurements  needed? 


Which  laboratories  have  the  capability  for  making  the  measurements? 


These  questions  were  answered  by  kinetlcists  and  photochemists  in  the  NBS  Physical 
Chemistry  Division,  using  the  data  base  accumulated  by  the  Kinetics  Center.  The 
questions  were  dlfHcult,  had  to  be  answered  quickly  and  specifically.  Many  guesses 
had  to  be  made. 


Later,  as  new  technical  ideas  were  put  forward  and  new  measurements  made,  program 
managers  turned  to  the  Center  for  interpretation  and  assessment  of  impact.  Editors 
sought  answers  to  questions  about  nomenclature,  units  and  writing  chemical  reactions. 


This  advisory  role  did  much  to  establish  the  usefulness  of  the  Kinetics  Center  in 
the  eyes  of  the  program  managers.  It  is  a service  that  information  analysis  centers  can 
and  should  offer  to  applied  programs.  Not  only  does  it  pay  off  but  it  keeps  a center  on 
top  of  current  problems. 


4.2  The  role  as  coordinator.  The  technical  output  of  CIAP  was  organized  around  a series 
of  six  monographs  [3],  the  first  of  which  is  on  the  natural  stratosphere  and  contains  a 
chapter  on  the  chemistry.  This  was  written  by  a panel  of  chemists,  mostly  kineticists, 
including  two  members  of  the  Center.  The  first  draft  was  written  at  a week-long  meeting 
in  November  1972.  During  this  meeting  the  panel  also  prepared  a list  of  chemical  reactions 
to  be  considered  in  modelling  the  stratosphere  and  a table  of  rate  constants  and  photo- 
chemical data.  Another  meeting  was  held  a year  later.  Two  major  revisions  of  the  chapter 
were  made  prior  to  publication. 

The  Chemical  Kinetics  Information  Center  provided  technical  and  editorial  support 
for  this  Chemistry  Panel.  For  the  initial  meeting  it  supplied  a library  of  papers  on 
kinetics.  For  each  of  the  drafts  it  coordinated,  edited  and  occasionally  retyped  the 
contributions  of  the  panel  members.  It  prepared  the  bibliography.  The  Center  took  charge 
of  the  tables  of  rate  data  set  up  by  the  Panel,  revising  and  expanding  them  throughout 
the  course  of  the  program  (see  section  4.3). 


This  type  of  coordination  is  ideal  for  an  information  analysis  center.  It  is  an 
interaction  with  its  home  community  - the  data  generators.  The  Center  provided  this 
measurement  community  with  information  on  the  state  of  the  art,  the  best  data  to  use  and 
lists  of  measurements  that  were  needed.  In  return  it  received  a steady  flow  of  reports 
on  new  work.  For  the  CIAP  Chemistry  Panel  the  Center  did  all  those  jobs  that  annoy 
authors  and  at  the  same  time  attempted  to  assure  a measure  of  uniformity  and  coherence 
in  the  final  publication. 

4.3  The  role  as  data  evaluator.  Before  data  are  used  in  an  applied  problem  the  wrinkles 
in  the  laboratory  results  need  to  be  smoothed  out.  The  user  wants  a number,  preferably 
with  some  statement  of  its  reliability,  not  a group  of  numbers  from  which  to  select.  Very 
often  a user  is  not  an  expert  in  the  discipline  that  supplied  the  data  and  should  not 
attempt  to  interpret  the  primary  research  reports. 

Codification  and  evaluation  of  data  are  the  prime  functions  of  an  informati^  anal- 
ysis center  and  are  the  jobs  it  understands  best.  They  should  be  central  to  the  role  the 
Center  plays  in  any  applied  program. 

Data  evaluation  is  much  more  than  simply  producing  a set  of  reliable  numbers.  It 
includes  an  analysis  of  the  demands  of  users  in  order  to  determine  their  actual  needs  and 
how  these  mesh  with  the  available  data.  It  also  requires  careful  attention  to  how  the 
associated  applied  program  evolves  in  order  to  keep  the  evaluation  work  in  line  with 
changing  emphases.  And  above  all,  the  evaluators  must  know  about  the  latest  experimental 
work.  New  data  must  be  obtained,  analyzed  and  distributed  quickly.  Otherwise  the  more 
knowledgeable  users  will  lose  faith  in  the  recommended  numbers. 


The  needs  of  CIAP  for  chemical  data  were  extensive  and  varied.  They  were  met  in 
several  ways.  First,  a survey  was  made  of  papers  on  stratospheric  chemistry  to  determine 
which  reactions  were  important  and  what  new  measurements  were  needed  either  to  improve 
the  accuracy  or  to  fill  gaps  in  the  data  base.  Second,  a workshop  for  chemical  kinet- 
icists and  atmospheric  modellers  was  held  at  NBS  in  the  Fall  of  1972.  This  provided  a 
forum  for  modellers  to  describe  their  needs  and  experimentalists  to  outline  measurements 
that  could  be  made  in  the  near  future.  Third,  as  indicated  in  the  previous  section,  an 
initial  table  of  rate  data  was  developed  by  the  CIAP  Chemistry  Panel.  This  got  the 
modellers  off  the  ground.  Fourth,  a data  evaluation  program  was  mounted  at  NBS  to  ex- 
tend this  initial  effort  and  keep  it  up  to  date.  It  involved  scientists  both  at  NBS  and 
in  other  laboratories,  the  results  being  compiled  and  distributed  by  the  Center.  Fifth, 
this  evaluation  activity  was  supplemented  by  compilations  of  new  data,  not  yet  evaluated. 


All  of  this  material  was  distributed  by  the  Center  in  a series  of  reports  [5].  The 
final  tables  were  published  in  CIAP  Monograph  No.  1 [3]  and  made  available  to  the  general 
public  as  NBS  Technical  Note  866.  Talks  were  also  given  at  CIAP  Conferences  [4]  in 
order  to  keep  users  aware  of  new  developments.  The  communications  component  of  evalua- 
tion work  is  surprisingly  large. 


4.4  The  role  as  communicator.  It  is  not  enough  to  evaluate  data.  The  recommended  values 
must  reach  the  users  and  reach  them  in  time  to  be  of  use.  It  may  be  necessary  to  persuade 
them  to  use  the  data.  It  also  may  be  desirable  to  check  on  how  the  data  actually  are 
used.  This  means  an  active  communications  plan. 

The  plan  for  CIAP  was  very  simple.  It  was  to  get  the  tables  of  data  to  anybody  who 
had  a use  for  them.  There  was  no  preset  distribution  list,  there  were  no  limitations. 

Any  organization  that  could  be  Identified  as  having  an  interest  in  atmospheric  chemistry 
was  put  on  the  distribution  list.  Published  papers  and  attendance  lists  of  meetings 
were  useful  sources.  The  CIAP  newsletter  advertised  the  tables.  Requests  for  tables 
were  honored  without  question.  Over  the  course  of  the  program  the  distribution  list  grew 
from  250  to  850  persons. 

This  policy  of  saturating  the  market  paid  off.  First,  most  of  the  modellers  began 
to  use  the  same  input  data  with  the  result  that  their  results  could  be  more  easily  inter- 
compared.  Second,  the  laboratory  scientists  began  to  see  which  data  were  well  established 


and  which  they  should  measure.  Third,  a large  number  of  people  not  formally  part  of 
CIAP  became  aware  of  that  program.  Fourth,  a feedback  of  ideas  from  all  sources  en- 
riched the  studies.  And  finally,  it  established  the  Kinetics  Center  as  the  source  to 
query  when  new  data  were  needed. 

In  retrospect,  the  feedback  was  an  important  factor  in  keeping  the  Center  on  top 
of  its  job.  Modellers  indicated  needs  for  new  data  and  told  what  effects  certain  re- 
action rates  had  on  their  predictions.  These  facts  indicated  directions  for  new  work. 

Laboratory  scientists  did  the  same,  contributed  their  measurements  in  advance  of  publi- 
cation and  criticized  the  recommended  values.  This  made  the  tables  current  and  provided 
quality  control. 

Another  communications  role  was  an  attempt  at  quality  control  of  the  models.  This 
consisted  of  monitoring  the  chemical  data  used  by  the  modellers.  Each  group  was  encouraged 
to  send  lists  of  its  input  data  to  the  Kinetics  Center.  These  lists  were  checked  for 
currency  and  reasonableness.  Errors  were  corrected  and  suggestions  were  made  about  the 
chemistry  and  about  the  reliability  of  the  numbers  being  used.  To  all  appearances  this 
service  was  appreciated  by  the  groups  that  took  advantage  of  it.  From  the  viewpoint  of 
the  program  the  service  improved  consistency  among  the  models  without  forcing  it  by  fiat. 

5.  Concluding  Remarks 

An  information  analysis  center  has  two  things  to  offer  to  a mission-oriented  program. 

They  are  technical  expertise  in  a scientific  discipline  and  experience-  in  information 

transfer.  The  first  enables  it  to  coordinate  and  i.iterpret  the  laboratory  measurements 

used  in  the  solution  to  the  problem.  Also,  an  existing  Center  with  extensive  data  files 

can  respond  quickly  in  the  early  stages  of  a program  when  important  decisions  will  be 

made.  The  second  simplifies  the  service  roles  of  communicator  and  advisor:  techniques 

used  in  the  past  can  be  brought  to  bear  on  a new  problem,  they  need  not  be  reinvented.  i 

The  four  roles  of  advisor,  coordinator,  evaluator  and  communicator,  described  here, 
are  not  specific  either  to  stratospheric  chemistry  or  to  chemical  kinetics.  They  should 
be  applicable  to  most  applied  problems  that  require  substantial  scientific  input.  Any 
information  analysis  center  should  recognize  ways  it  can  play  these  roles  and  use  them 
to  make  it  a focal  point  in  a mission-oriented  program. 
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SUMMARY 

The  advent  of  computer  technology  has  resulted  In  the  ever-increasing  need  to 
collect,  process  and  store  digital  Information.  Until  recently  conventional  storage 
techniques,  through  evolutionary  product  Improvements,  have  been  able  to  keep  pace 
with  this  growth.  However,  a point  has  been  reached  where  further  Improvement  Is 
becoming  technically  and  economically  Impractical.  The  United  States  Air  Force  has 
recognized  the  limitation  of  conventional  approaches  and  has  actively  pursued  develop- 
ment of  alternate  storage  techniques.  As  an  outgrowth  of  this  commitment,  the  Rome 
Air  Development  Center  has  developed  two  mass  data  storage  and  retrieval  systems  which 
will  have  a significant  Impact  on  present  data  processing  operations.  These  systems 
have  been  designated  Human  Readable  Machine  Readable  (HRMR)  I^lcrofllm  Mass  Memory 
System  and  Wideband  Holographic  Recorder/Reproducer . 

As  world  events  become  more  Intimately  related  to  our  nation's  security.  It 
becomes  more  Important  to  maintain  large  reserves  of  reference  Information  to  accu- 
rately and  rapidly  assess  and  respond  to  crisis  situations.  From  all  available 
sources.  Information  Is  collected  and  processed  by  a network  of  sophisticated  data 
handling  systems  In  support  of  Intelligence,  command  and  control,  logistics  and  other 
operational  requirements.  In  each  application.  Information  storage  and  retrieval 
comprises  the  Important  resource,  with  computer  systems  serving  to  process  and  provide 
accurate,  usable  Information  In  the  right  place  at  the  right  time.  Such  applications 
have  stimulated  a demand  for  larger,  cheaper,  and  faster  data  storage  and  retrieval 
capabilities. 

To  meet  this  need  a number  of  manufacturers  have  stepped  up  efforts  to  produce 
new  mass  storage  systems.  In  fact,  recent  studies  have  projected  the  market  potential 
for  such  systems  to  be  between  $500  million  and  $1  billion.  As  a result  of  the  In- 
creased R&D  activity,  several  advances  have  taken  place;  the  most  dreusatlc  of  which 
have  been  In  electro-optic  data  storage. 

Communications  over  long  distances  have  been  greatly  Improved  by  the  ability  of 
satellites  to  rapidly  transmit  gigabits  of  data  between  widely  separated  points. 
Satellites  have  also  gained  In  popularity  as  a means  of  monitoring  and  managing  our 
dwindling  natural  resources.  For  example,  the  Landsat  satellite,  which  now  orbits  the 
earth,  sends  back  dally  Information  on  waterflow,  agriculture,  geography,  land  use, 
fishing  grounds  and  mineral  resources.  Repeat  observations  of  the  same  geographic 
area  are  normally  scheduled  days  apart.  Also,  analysts  and  scientists  require  com- 
parative data  covering  many  weeks,  months,  or  years.  Consequently,  ground  data 
stations  must  possess  the  capability  for  reliably  storing  and  retrieving  large  volumes 
of  analog  and  digital  data  over  long  periods.  Capture  and  retention  of  this  high 
volume,  highly  volatile  Information  has  been  made  possible  laigely  through  the  develop- 
ment of  advanced  wideband  recording  methods. 

Today,  magnetic  tape  Is  commonly  employed  to  satisfy  these  large  data  storage  and 
retrieval  requirements.  Many  automatic  data  processing  centers  within  the  Government 
have  libraries  containing  over  20,000  reels  of  magnetic  tape.  Using  this  storage 
medium,  data  can  be  read  processed  and  written  using  standard  computer  equipment. 
Magnetic  tape,  however,  does  have  two  significant  problems  that  limits  Its  further 
application.  First,  It  Is  an  expensive  storage  medium,  particularly  In  the  context  of 
large  data  storage  applications.  A reel  of  tape  typically  costs  a minimum  of  ten  (10) 
dollars,  while  Instrument-quality  tape  can  exceed  one-hundred  (100)  dollars.  There  Is 
also  the  expense  associated  with  maintaining  the  Integrity  of  the  tape  library.  Due 
to  the  tape's  erasable  nature,  stored  digital  data  will  degrade  through  use  and  storage. 
Magnetic  tape  Is  also  highly  susceptible  to  spurious  electric  and  magnetic  fields 
which  are  especially  severe  In  most  military  environments.  This  makes  tape  vulnerable 
to  both  data  loss  and  alteration  through  either  accidental  or  Intentional  tampering. 

The  Rome  Air  Development  Center  (USAF)  has  recognized  the  problems  of  utilizing 
magnetic  tape  for  a number  of  years,  and  an  alternative  mass  data  storage  concept, 
using  an  optical  recording  technique,  has  been  developed.  This  approach  has  been 
designated  the  Human  Readable  Machine  Readable  (HRMR)  Microfilm  Mass  Memory  System. 

The  system  Is  capable  of  recording  In  near  real-time  both  human  readable  Information 
and  Its  total  digital  equivalent  on  a common  film  format.  The  significant  feature  of 
this  concept  Is  that  the  machine  readable  data  Is  stored  In  an  unused  part  of  the  film 
chip  In  holographic  form. 


One  of  the  reasons  for  selecting  holography  is  Its  high  Information  storage 
capacity.  In-depth  analyses  and  experimental  breadboard  models  have  demonstrated  a 
capability  for  reliably  storing  and  retrieving  digital  data  at  a conservation  density 
of  250  thousand  bits  per  square  centimeter.  Compared  to  conventional  external  computer 
memory  storage,  a significant  reduction  In  storage  can  be  achieved  through  holographic 
recording  techniques. 

In  an  effort  to  validate  the  concept  of  holographic  data  storage,  a system  was 
designed,  fabricated  and  delivered  to  RADC  In  May  1973-  The  equipment  shown  In 
Figure  1 has  been  actively  used  for  In-house  and  contractual  experimentation.  The 
system  consists  of  the  following  modules:  (1)  microfiche  recorder;  (2)  subsystem 
controller/computer  (PDP-11/15);  and  (3)  microfiche  reader.  Input  devices  Include 
magnetic  tape,  document  digitizer  and  modem  coupler.  Output  from  the  system  Is  In 
the  form  of  HRMR  microfiche,  hardcopy  from  the  CRT  display,  magnetic  tape  or  digital 
signal  to  a host  computer  via  a modem  unit.  System  access  and  control  Is  through  the 
CRT  display  terminal. 

The  HRMR  system  Is  capable  of  producing  any  of  three  microfiche  formats  as 
depicted  In  Figure  2.  The  first  Is  an  alphanumeric  microfiche  containing  both  human 
readable  microimages  and  their  equivalent  In  binary  form.  To  create  this  particular 
format,  ASCII  formatted  data  Is  used.  Hardcopy  documents  can  also  be  stored  In  the 
form  of  a graphic  arts  microfiche.  This  Is  achieved  by  raster  scanning  each  page  Into 
a digital  bit  stream  using  a document  digitizer.  Data  compression  techniques  are 
applied  to  this  Information  to  reduce  the  data  storage  requirements.  The  third  format, 
and  the  one  that  shows  the  most  promise.  Is  the  mass  store  microfiche.  This  film 
chip  has  only  machine  readable  Information  except  for  an  eye-readable  title  block  a. 
the  top.  Up  to  30  million  bits  can  be  recorded  on  one  microfiche.  As  a way  of 
providing  a meaningful  comparison,  one  reel  of  magnetic  tape,  with  no  Inter-record 
gaps.  Is  capable  of  storing  100  million  bits  at  a cost  of  at  least  ten  (10)  dollars. 

In  most  real  ADP  applications,  tape  does  have  Inter-record  gaps  and  Is  seldom  filled 
from  end  to  end.  On  the  other  hand,  a HRMR  microfiche  costs  only  thirty-five  (35) 
cents  to  produce  and  has  a large  data  storage  capacity.  In  most  applications  one 
HRMR  microfiche  will  be  capable  of  storing  the  same  amount  of  data  as  one  reel  of 
magnetic  tape.  Another  advantage  of  this  approach  Is  that  data  can  be  rapidly  accessed 
(In  seconds)  In  a random  manner.  In  contrast,  retrieval  speed  for  magnetic  tape, 
measured  In  terms  of  minutes  Is  severely  limited  by  sequentially  searching  for  the 
desired  record. 

The  block  dlagreun  of  the  HRMR  system  shown  In  Figure  3 depicts  the  major  subsystems 
of  the  engineering  development  model.  The  first  Is  the  recorder  which  operates  In 
the  following  sequence.  Pre-cut  film  Is  automatically  transported  to  the  laser 
record  station  at  which  both  HR  and  MR  data  Is  recorded.  Once  this  step  Is  completed 
the  system  delivers  the  film  chip  to  an  on-line  film  processor  which  develops,  fixes 
and  dries  the  film.  The  next  station  Is  the  verifier  which  will  sequence  through  the 
entire  digital  record  to  ensure  that  all  data  Is  properly  recorded.  The  verifier  can 
also  serve  as  a backup  reader  system.  Once  outputted  from  the  verifier  a metal  clip 
Is  affixed,  a unique  Identification  number  Is  assigned  and  the  chip  Is  loaded  Into 
the  microfiche  storage  and  retrieval  module.  Also  Included  as  an  Integral  part  of 
the  automatic  retrieval  mechanism  Is  another  hologram  reader  to  provide  rapid  data 
access.  Controlling  the  entire  operation  of  the  HRMR  system  Is  a PDPll/45  computer/ 
controller.  Its  function  Is  to  maintain  proper  sequencing  of  all  system  operations 
and  to  accomplish  data  transfer  between  various  equipment  modules  and  other  systems. 

Data  recording  and  readout  Is  operated  at  500,000  bits  per  second,  which  Is  comparable 
to  high  speed  tape  systems.  Update  of  Information  recorded  on  microfiche  can  be 
rapidly  and  conveniently  achieved.  This  Is  accomplished  under  computer  control 
through  the  following  sequence:  (1)  desired  microfiche  selected  and  read;  (2)  digital 

data  transferred  to  system  controller;  (3)  entry  of  appropriate  data  changes;  and  (<() 
Initiation  of  record,  process  and  verify  operations.  Once  completed,  a new,  first- 
generation  microfiche  Is  produced  that  Is  ready  for  Inflle  within  the  storage  and 
retrieval  module. 

All  Information  (both  HR  and  MR)  Is  recorded  using  high-speed  laser  scanning 
techniques.  This  method  Is  similar  to  that  employed  by  some  of  the  more  advanced 
Computer  Output  Microfiche  (COM)  devices  currently  marketed.  The  recorder  module 
shown  In  Figure  consists  of  a low-powered,  continuous-wave  (CW)  Argon  laser. 

Output  light  Is  modulated  by  an  analog  signal  representing  the  Fourier  Transform  of 
the  Input  digital  bit  stream  to  produce  the  digital  store  holograms.  Through  a 
series  of  optical  elements  the  modulated  light  Is  properly  shaped,  focused  and  scanned 
onto  a 105  mm  x 148  mm  film  chip.  The  corresponding  human  readable  microimages  are 
recorded  In  a similar  fashion  by  deflecting  the  laser  beam  In  the  Y direction  while 
the  film  Is  transported  at  high  speed  In  the  X direction. 
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The  basic  building  block  of  the  microfiche  storage  and  retrieval  (MStR)  device 
Is  an  Image  System  "Carousel"  unit.  This  Is  an  off-the-shelf  Item  and  Is  capable  of 
holding  750  microfiche.  Image  Systems  built  and  marketed  a large  number  of  these 
units  for  various  data  retrieval  applications.  Field  performance  has  demonstrated 
their  exceptional  reliability  and  maintainability.  The  film  extraction  mechanism, 
supplied  by  Image  Systems,  will  also  be  Incorporated  Into  the  MS&R.  As  shown  In 
Figure  5,  nine  carousels  are  stacked  on  a common,  rotating  shaft.  This  provides  a 
storage  capacity  for  6750  microfiche.  Adjacent  Is  a hologram  reader  unit.  This 
device  moves  In  the  vertical  direction  and  can  access  and  extract  any  microfiche 
within  a maximum  of  15  seconds.  If  the  reader  can  Imagine  for  a moment  a device 
completely  filled  with  mass  store  microfiche  (each  storing  30  million  bits),  a 2xl0H 
bit  mass  memory  system  can  be  readily  achieved.  Whenever  Information  from  a micro- 
fiche Is  needed  by  the  computer,  a mechanical  extractor  selects  the  desired  micro- 
fiche and  transports  It  to  the  read  station.  As  user  requirements  Increase,  additional 
microfiche  storage  devices  can  be  Interfaced  to  the  system  controller  via  a highspeed 
asynchronous  communication  channel.  This  enormous  storage  capacity  makes  It  possible 
to  put  the  equivalent  of  an  entire  tape  library  on-line  to  a computer  system. 

Recovery  of  the  digital  data  stored  as  Fourier  Transform  holograms  Is  accomplished 
by  the  reader  module  shown  In  Figure  6.  Low-powered,  CW  coherent  light  Is  used  to 
Illuminate  each  hologram  track  as  It  Is  transported  at  a constant  rate  by  the  film 
transport  assembly.  Light  transmitted  through  the  hologram  Is  sensed  by  a linear 
photodetector  located  at  the  back  focal  plane.  The  data,  which  represents  the  original 
binary  data,  can  now  be  transferred  to  the  host  computer  In  response  to  a data 
retrieval  request. 

In  Figure  7,  one  of  the  Fourier  Transform  holograms  Is  shown.  Generation, 
recording  and  readout  of  these  holograms  Is  under  computer  control.  Besides  possessing 
very  high  data  storage  density,  holograms  also  demonstrate  data  redundancy  char- 
acteristics. This  property  makes  holography  highly  tolerant  to  dust,  scratches  and 
film  Imperfections.  Each  hologram  stores  approximately  6^  binary  bits  which  are 
uniformly  distributed  across  the  entire  length  of  the  hologram.  This  makes  each  data 
cell  relatively  large  (1.6  mm  x 14  um),  compared  to  the  data  cells  required  for 
direct  binary  recording  (exposure  of  a series  of  spots  representing  ones  and  zeros). 
These  large  data  cells  are  easier  to  locate  and  contribute  greatly  to  the  simplicity 
of  the  readout  process.  Since  the  data  Is  not  localized  to  one  area  but  evenly 
distributed  throughout  the  hologram,  very  low  bit  error  rates  can  be  achieved.  As 
can  be  seen  from  the  figure,  coherent  light  Illuminates  each  hologram  and  Is  recon- 
structed as  a series  of  bright  spots  at  the  back  focal  plane.  The  photodetectors 
located  at  this  position  sense  this  light  which  corresponds  directly  with  the  original 
Information. 

To  conclude  discussion  of  the  HRMR  system.  Its  key  benefits  are  outlined  In 
Figure  8.  More  Information  can  be  stored  and  maintained  using  holographic  recording 
techniques  than  can  be  achieved  using  conventional  magnetic  techniques  which  allows 
much  larger  data  bases  to  be  retained  for  on-line  computer  processing.  This  enhanced 
capability  will  greatly  Improve  the  production  of  timely  and  complete  digital  products. 
Since  all  Information  Is  recorded  on  film,  the  user  also  realizes  the  significant 
advantage  of  an  Inexpensive  storage  medium  capable  of  retaining  this  Information  as  a 
permanent,  maintenance-free  record.  With  the  dual  format  microfiche,  both  a human 
user  and  a computer  can  directly  access  desired  Information  from  a common  storage 
medium.  Finally,  by  employing  holographic  technology,  low  error  rates  and  high 
rellabllllty  can  be  achieved. 

The  use  of  holographic  techniques  for  storage  and  retrieval  of  document-oriented 
material  represents  only  one  solution  to  the  mass  memory  problem.  This  does  not 
address  the  requirement  for  the  storage  and  retrieval  of  digital  data  In  extremely 
long  records  and  at  very  high  recording  and  readout  rates,  such  as  those  presented  by 
a satellite  transmission.  Typical  requirements  extend  Into  the  1.0  to  2.0  gigabit 
per  second  (GBPS)  range  with  error  rates  not  to  exceed  1 bit  In  10^  bits  required. 
Investigation  Into  the  more  "well  established"  wideband  recording  techniques,  l.e., 
magnetic  longitudinal  and  rotary  head,  laser  and  electron  beam  and  their  predicted 
capabilities  Into  the  198O  time-frame  yield  results  similar  to  those  shown  In  Figure 
9.  This  rapid  and  continuing  growth  In  data  rates  1s  far  beyond  the  capabilities  of 
current  data  handling  equipment.  As  Illustrated,  the  more  conventional  recording 
schemes  cannot  approach  the  1.0  to  2.0  GBPS  rates  under  consideration.  To  meet  these 
demands,  Rome  Air  Development  Center  (RADC)  under  a cost-shared  Air  Force  Systems 
Command,  Directorate  of  Laboratories  and  Advanced  Research  Projects  Agency  (APSC 
(DD/ARPA)  program  has  demonstrated  a 750  megabit  per  second  (MBPS)  recording  capability 
with  possible  extension  to  2.0  gigabit  per  second  (GBPS).  The  concept  employs  a 
holographic  rather  than  blt-by-blt  laser  recording  scheme. 


RECONSTRUCTION  OF  COMPUTER  GENERATED  HOLOGRAM  CREATED 
FOR  READOUT  WITH  EXPERIMENTAL  GoAs  HOLOGRAM  READER. 

THE  BIT  SEQUENCE  SHOWN  IS  101001 101001 100001 101 1 1 10001 101 1 

Figure  7 
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WiOilAND  RECORDING  - PREDICTED  PERFORMANCE 


As  shown  In  Figure  10,  the  recording  medium  Is  35  millimeter  (mm),  silver  halide, 
photographic  film  In  roll  form.  Incoming  serial  data  Is  demultiplexed  Into  128 
parallel  channels.  One  bit  from  each  of  the  128  channels  Is  stored  In  parallel  In  a 
one-dlmenslonal  hologreun.  Holograms  are  recorded  across  the  film  width  In  rows  and 
measure  0.8  mm  In  length  on  0.016  mm  centers.  In  addition,  there  Is  a 0.2  mm  guard- 
band  between  hologram  rows.  The  active  recording  area  of  the  film  Is  24.2  mm,  with 
the  system  scanning  1512  holograms  per  row.  This  results  In  an  average  packing 
density  equivalent  to  greater  than  800  thousand  bits  per  square  centimeter. 

Figure  11  depicts  a simplified  configuration  of  the  Wideband  Holographic  Recorder. 
The  diagram  shows  the  externally  modulated  continuous  wave  (CW)  Argon  laser  used  In 
both  recording  and  readout.  The  beam- forming  optics  split  the  laser  beam  Into  both  a 
signal  and  a reference  beam.  Both  beams  pass  through  the  acousto-optic  page  composer, 
which  generates  a 128  bit  spatial  pattern  and  a 1 bit  reference  beeun.  The  faceted 
mirror  scanner  sweeps  both  patterns  across  the  width  of  the  35  mm  film  through  a 
transform  lens.  The  Transform  lens  generates  the  Fourier  Transform  hologram  of  the 
data.  This  results  In  the  Information  contained  In  the  128  data  bits  being  dis- 
tributed throughout  the  entire,  relatively  large,  area  of  the  hologram. 

After  the  exposed  film  Is  removed  from  the  transport  and  processed  using  con- 
ventional techniques.  It  Is  replaced  In  the  transport  and  Illuminated  by  the  reference 
beam  alone.  An  Image  of  the  original  data  Is  thus  formed  and  detected  at  the  photo- 
detector array.  The  data  Is  then  multiplexed  Into  the  original  data  stream. 

A holographic  recorder/reproducer  has  several  fundamental  dlferences  that  have 
proven  to  be  advantageous  over  conventional  recording  schemes.  First,  the  Incoming 
high  rate  bit  stream  Is  demultiplexed  Into  128  lower  data  rate  channels,  each  channel 
being  Introduced  to  the  system  In  parallel  at  6 MBPS.  This  relaxes  the  high-speed 
requirements  placed  upon  the  various  optical  mechemlsms  such  as  the  light  modulator, 
spinning  mirror  assembly  and  readout  devices.  The  relaxation  of  Individual  channel 
bandwldths  also  places  less  severe  constraints  on  the  film  In  terms  of  reciprocity 
characteristics. 

The  use  of  holographic  registration  on  film  reduces  the  difficulty  In  tracking 
data  across  the  width  of  the  film  on  playback.  The  task  of  tracking  a hologram  whose 
area  Is  on-the-order  of  millimeters  rather  than  a laser  spot  on-the-order  of  microns 
Is  a considerable  advantage  Inherent  to  the  system.  In  addition,  due  to  the  nature 
of  Fourier  Transform  coding,  the  data  contained  In  a particular  hologram  Is  dis- 
tributed uniformly  throughout  the  hologram.  As  a result  of  this  redundancy  and 
npntrnl  (nvnriSl'W**  iiiifiTHy,  the  dntn  is  relatively  Immune  to  media  Imperfections  such 
as  film  scratches  due  to  handling  or  bubbles  due  to  processing  variations. — 

Progress  to  date  has  been  satisfactory  and  reported  results  Impressive.  During 
January,  1976,  an  Implementation  of  an  engineering  development  model  (Figure  12) 
proved  Itself  capable  of  recording  and  reproducing  data  at  a 750  MBPS  raw  data  rate. 
This  raw  rate  Includes  error  correction  coding  which  translates  to  600  MBPS  of  user 
data.  The  playback  of  the  750  MBPS  data  produced  error-free  readouts  on  single 
selected  channels  over  approximately  200  feet  of  film.  Initial  quantitative  results 
from  analyzing  this  data  Indicates  an  overall  error  rate  In  the  1 bit  In  10^  range. 
Figure  13  summarized  the  performance  of  the  engineering  development  model  to  date. 

In  the  context  of  describing  the  HRMR  and  Wideband  Recording  progreuns,  we  have 
attempted  to  characterize  the  benefits  to  be  realized  by  applying  holography  to 
digital  data  storage.  It  Is  clear  from  the  discussion,  that  each  approach  has  been 
directed  toward  fulfilling  an  existing  operational  deficiency.  One  alms  to  provide 
ready  access  to  large  data  stores,  while  the  other  serves  to  provide  exceptionally 
high  data  recording  and  readout  performance.  Research  studies  have  documented  the 
Impressive  progress  of  holographic  data  storage.  This  progress  has  been  made  possible 
largely  through  the  efforts  of  the  RAD  community.  While  the  promise  of  these  system 
remain  high,  the  true  test  of  any  new  product  lies  not  only  In  Its  ability  to  perform 
In  the  real  world,  but  In  Its  ability  to  surpass  traditional  technologies. 


Figure  11 


Figure  13 
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MULTIMODE  NETTING  BY  WIDEBAND  CABLE 

VICTOR  A.  DEMARINES  and  GEORGE  A.  FAGAN 
The  MITRE  Corporation 
Bedford,  Maeaachuaetta,  U.  S>A.  01730 


SUMMARY 

The  rapid  growth  of  low  coat  digital  circuitry  ia  affecting  all  forma  of  electronic  technology. 

Because  cheap  and  reliable  devices  are  becoming  more  and  more  available,  the  concept  of  information 
processing  is  altering.  Digital  processing  is  still  important,  but  is  being  augmented  by  analog  devices  - 
video  and  audio  recordings,  microfiche,  etc.  Digital  processing  itself  is  altering,  as  distributed  pro- 
cessors and  smart  terminals  became  conunon.  Conventional  communicatioas  technology  does  not  adapt 
well  to  the  new  demands.  The  multimedia  communications  bus,  based  upon  current  television  coaxial 
cable  technology,  offers  an  economical,  reliable  and  flexible  alternative. 


INTRODUCTION 

It  is  hardly  necessary  to  state  that  the  fleld  of  information  processing  is  changing  - fliat  seems  to 
have  been  true  for  flie  last  twenty  years  - and  may  well  be  true  for  the  next  fifty.  Many  of  the  changes 
are  doe  simply  to  developing  technology,  but  beyond  that,  there  are  changes  in  concept,  changes  in  the 
way  people  think  of  information  systems,  that  will  have  significant  effects  upon  the  architecture  of 
systems  in  the  future.  Because  military  organizations  have  been,  and  probably  will  continue  to  be, 
leaders  in  systems  development  and  application,  they  are  among  the  first  to  feel  the  impact  of  the 
changes. 

In  the  first  place,  the  concept  of  information  processing  has  broadened  considerably.  Often,  in  the 
past,  the  term  was  used  almost  as  a synonym  for  digital  data  processing,  with  a possible  extension  to 
include  on-line  retrieval  and  manipulation.  The  link  with  the  digital  computer  is  still  there,  but  non- 
digital forms  are  also  being  recognized  as  useful  and  necessary  parts  of  an  information  system.  We 
now  find  the  various  forms  of  voice  communications  - telephone,  radio,  secure  or  open,  being  con- 
sidered as  part  of  an  integrated  design  rather  than  being  accepted  as  separate  and  independent  systems. 

Television  also  is  starting  to  assume  a significant  role,  not  merely  for  entertainment,  but  in  such 
varied  applications  as  surveillance,  teleconferencing,  remote  access  to  documents,  and  interactive 
education  systems.  Digital  computer  systems  and  digital  communications  are,  of  course,  e*^U 
important,  but  they  are  no  longer  the  sole  concern  of  the  information  system  designer. 

Secondly,  the  development,  during  the  last  few  years,  of  low  cost  digital  devices  is  changing  the 
approach  to  computing.  Computing  power  is  no  longer  rare  and  expensive  and  the  designer  is  no  longer 
forced  to  think  in  terms  of  nnaximizing  flie  utilization  of  an  expensive  central  computer  facility.  Mini- 
computers and  smart  (in  some  cases,  brilliant)  terminals  can  be  dispersed  throughout  a facility  for 
individual  use.  The  distribution  of  processing  and  the  resulting  need  for  remote  access  to  data  bases 
that  may  themselves  be  physically  dispersed  creates  new  problems  of  communication.  A great  deal  of 
effort  is  therefore  being  pot  into  the  development  of  effective  computer  netting  systems,  and  though 
these  may  reduce  the  inefficiencies  and  delays  which  have  occurred  in  large  time- sharing  systems, 
they  place  additional  strain  on  the  communications  system,  because  large  portions  of  the  data  base  may 
be  shared  by  many  devices.  The  volume  of  digital  traffic  between  dispersed  computing  facilities  and  a 
logically  central  data  base  may  be  considerably  greater  than  that  between  dispersed  terminals  and 
central  computing  facilities. 

One  of  the  most  revolutionary  effects  of  the  ready  availability  of  new  technology  is  that  the  con- 
ventional paper  systems,  on  which  most  of  the  world's  organizations  depend,  can  now  be  seen  to  be 
wasteful  of  manpower,  materials  and  space.  In  the  past  few  years  the  same  pattern  has  emerged  in 
most,  if  not  all,  of  the  industrialized  countries;  mail  services  have  become  less  efficient,  less  timely, 
and  more  expensive,  while  the  scope  of  services  has  been  reduced.  In  the  case  of  national  postal  | 

services,  the  results  are  clear  for  all  to  see;  similar  problems  occur,  but  are  not  so  visible,  in  the  ' 

internal  operations  of  large  industrial  companies  and  military  organizations.  Various  forme  of  j 

electronic  mail  systems  are  therefore  being  considered  and  evaluated.  These  include  for  example  ] 

digital  message  transmission,  facsimile,  video,  remotely  accessed  micro-  and  ultra-fiche,  etc. 

The  range  of  possibilities  is  great,  but,  in  any  organization  where  the  volume  of  internal  mail  ia  j 

sufficient  to  justify  an  electronic  distribution  system,  the  required  number  of  terminals  and  the  volume  j 

of  electronic  traffic  would  be  very  large. 

Historically,  and  conventionally,  communications  systems  have  been  developed  and  added  to  the 
existing  base  - telegraph  was  added  to  mail,  telephone  to  telegraph,  then  radio,  flien  television,  then 
digital  transmission.  Each  different  type  of  service  tends  to  have  its  own  physical  plant  and  support 
organization  and  generally  one  of  the  ground  rules  for  adding  a new  service  is  that  existing  services 
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mu«t  not  be  modified  or  impaired  in  any  way  that  is  visible  to  the  user.  This  approach,  though  una- 
voidable, has  led  to  the  creation  of  multiple  services,  sometimes  redundant,  sometimes  wasteful  of 
resources,  and  often  so  inflexible  as  to  Inhibit  and  obstruct  the  introduction  of  new  features.  The 
multiplicity  of  separate  services,  which  are  competitive  rather  than  cooperative,  is  not  compatible  with 
the  integrated  information  system;  a comprehensive  and  flexible  system  will  be  needed  to  support  the 
emerging  applications.  This  system  must  provide  the  ability  to  transmit  information  in  a variety  of 
forms,  among  a large  number  of  users,  in  timely  fashion. 

The  concept  that  is  being  used  and  developed  at  MITRE  draws  upon  two  technologies,  cable  com- 
munications and  low  cost  digital  electronic^  to  support  multimode  "bus"  systems.  The  cable  is  a con- 
ventional coaxial  cable  as  is  used  for  closed  circuit  or  community  antenna  television;  the  modes  include 
voice,  video,  audio,  and  digital  transmissions;  the  terminal  devices  include  telephones,  television 
cameras  and  monitors,  teletypes,  cathode  ray  displays,  computers,  printers,  and  entry  control 
devices  all  attached  to  and  communicating  through  a single  coaxial  cable  distribution  system.  The  term 
"bus"  implies  that  all  information  is  available  to  all  participants.  The  advantages  of  such  a system  are; 

flexibility  - new  services  and  new  subscribers  can  be  added  easily, 
cost  - not  only  is  the  installation  cost  much  lower  than  that  of  the  conventional 

separate  systems,  but  the  interfacing  devices  are  also  inexpensive. 

At  present  MITRE  is  involved  in  experiment,  application  and  system  design  using  this  type  of  multi- 
mode  communications  for  a number  of  Department  of  Defense  and  other  sponsors.  We  are  developing 
the  concepts  and  devices  for  application  to  Army  Base  systems,  large  government  agencies.  Air  Force 
Command  Centers,  and  hospitals,  but  we  can  foresee  their  utilization  in  a great  variety  of  communities 
where  there  are  significant  problems  in  maintaining  adequate  communications.  Military  and  govern- 
mental organizations  have  such  problems,  but  they  are  not  alone;  industrial  corporations,  financial 
institutions,  manufacturing  plants,  and  universities  have  similar  problems  and  could  use  similar 
solutions. 

The  pressures  created  by  the  rapid  growth  in  electronic,  and  particularly  digital,  technology  are 
forcing  a reappraisal  of  the  conventional  communications  structure.  Cable-contained  bus  communi- 
cation is  a vehicle  that  is  needed  to  support  new  applications  economically  in  a ulde  variety  of  closed 
communities,  and  we  believe  that  there  will  be  tremendous  increase  in  its  use  in  the  next  few  years. 


CABLE  TECHNOLOGY 

The  notion  of  coaxial  cable  as  a multimode  transmission  medium  tends  to  be  distorted  by  the  under- 
lying  association  with  CATV  - Community  Antenna  Television  - and  the  feeling  that  the  cable  is  some- 
how "full"  of  television  channels.  To  dispel  that  idea  we  must  consider  the  cable  simply  as  a radio 
frequency  medium.  It  has  a frequency  dependent  attentuation  characteristic  but  despite  this,  common 
commercial  practice  provides  a usable  bandwidth  up  to  300  megahertz,  and  in  some  experimental  appli- 
cations, as  much  as  900  MHz  has  been  used.  As  the  cable  is  shielded,  it  is  virtually  free  from  inter- 
ference, Therefore  the  cable  provides  a capability  that  can  be  employed  in  a wide  variety  of  ways  just 
as  radio  communication  can  be  employed.  Frequency  bands  can  be  allotted  to  specific  purposes  and 
within  those  bands  well  known  modulation  and  multiplexing  techniques  can  be  employed. 

There  is  nothing  strange  or  novel  about  the  use  of  wideband  cable.  The  research  and  development 
was  done  long  ago,  and  the  devices  that  are  required  to  support  wideband  transmission  over  long  cable 
lengths  are  available  from  stock  from  a number  of  suppliers.  The  cable  television  industry  maintains 
a standard  80  dB  isolation  for  all  components  of  the  distribution  network.  This  means  that,  in  ex- 
ceptionally noisy  environments,  such  as  manufacturing  plants  with  much  heavy  equipment  generating 
electromagnetic  interference,  the  noise  level  in  cable  is  low  enough  to  provide  excellent  performance. 
The  practice  in  the  industry  is  to  maintain  an  adequate  signal  level  by  Installing  frequency  compensated 
linear  repeaters  at  distances  along  the  cable  corresponding  to  22  dB  attenuation  at  the  highest  frequency 
of  interest;  typically,  a repeater,  which  coats  about  $300,  is  installed  every  1000  feet  (300  metres). 
Cable  itself  costs  about  10  cents  per  foot  (33  cents  per  metre),  and  installafion  cost,  when  the  cable  and 
repeaters  are  hong  on  existing  telephone  poles,  is  about  four  times  the  cable  cost.  An  average  figure 
for  the  total  installed  cost  of  a cable  and  repeaters  is  therefore  about  80  cents  per  foot  ($2.65  per 
metre),  but  this  will  of  course  vary  from  one  installation  to  another.  For  two  way  transmission,  it  is 
convenient,  but  not  essential,  to  install  a pair  of  cables;  because  of  the  installation  cost,  udiich  is  not 
doubled  if  two  cables  are  hung  at  one  time,  such  an  arrangement  would  cost  about  $1,  30  per  foot 
($4.  30  per  metre). 

The  technique  for  adding  subscribers  to  a cable  is  also  well  established  - simple  cable  taps  are  used 
and  all  subscribers  potentially  have  access  to  all  the  information  on  the  cable;  actually,  access  to 
services  is  limited  by  the  type  of  interfacing  equipment  supplied  to  the  subscribers,  just  as  is  the  case 
for  broadcast  radio.  Controls  are  obviously  needed  to  prevent  unauthorised  access  uitich  can  range 
from  theft  of  a service  (e.  g.  television,  or  computer  time)  to  disclosure  of  classified  data.  If  file 
potential  risks  are  recognized  during  the  design  phase,  protection  can  bo  built  into  the  system;  the  real 
dangers  are  therefore  less  severe  than  they  at  first  appear. 


The  costs  end  the  performance  of  this  type  of  cable  installation  have  been  well  established  by 
several  years  of  practical  field  experience  of  cable  television  in  many  cities  in  both  the  United  States 
and  Europe.  Costs  are  low,  reliability  is  high,  performance  is  excellent.  The  availability  of  the  radio 
frequency  spectrum  is  illustrated  by  Figure  1,  which  shows  the  signal  spectral  pattern  in  use  on  a multi- 
mode  caUe  at  MITRE.  This  installation  is  providing  television  (both  commercial  and  closed  circuit), 
frequency  modulated  commercial  radio,  voice  telephone,  and  digital  services,  simultaneously. 


Figure  1.  Frequency  Spectrum  of  the  MITRE  Demonstration  System  Cable 


The  provision  of  cable  plant  in  a community  not  only  will  support  a wide  variety  of  services,  but 
also  makes  it  possible  to  use  a single  control  facility  for  fault  detection  and  isolation.  Also  it  makes  it 
possible  for  any  user  to  connect  to  the  information  system,  anywhere  in  the  community  at  little  more 
than  a moment's  notice,  for  only  a simple  tap  is  needed.  In  the  remainder  of  this  paper  we  propose  to 
discuss  briefly  fte  conventional  services,  and  to  dwell  in  some  detail  on  the  digital  data  communications 
system  in  use  at  MITRE. 


CABLE  TELEVISION  SERVICES 

By  far  the  most  commonly  known  use  of  coaxial  CATV  cable  is,  of  course,  for  the  transmission  of 
television.  Such  services,  u^ether  for  off-the-air  television,  closed  circuit  television  (e.  g.  for  con- 
ferencing or  surveillance  systems)  and  even  for  many  computer  assisted  instruction  applications,  can 
be  accommodated  in  4.  5 megahertz  bandwidth  within  6 MHz  channels.  Twenty  such  channels  could 
therefore  be  available  on  the  cable  and  use  only  120  MHz  of  the  available  300  MHz  bandwidth  - obviously 
there  is  room  for  other  services. 

Some  television- like  services  require  higher  resolution  and  therefore  more  bandwidth  than  con- 
ventional television.  For  instance,  the  multicharacter  video  display  terminals  that  are  used  in  some 
computer  aided  instruction  applications  may  require  15  MHz  channels.  Dissemination  of  microfiche 
for  remote  viewing  requires  extremely  high  resolution  to  accommodate  the  fine  print  and  line  widths  of 
many  documents,  and  could  require  100  MHz  or  more  for  continuous  transmission.  Fortunately,  these 
types  of  application  rarely  need  continuous  transmission.  The  viewer  normally  spends  a considerable 
time,  perhaps  several  minutes,  studying  a single  image  so  file  strain  on  the  transmission  can  be 
reduced  by  providing  local  storage  devices  (so-called  frame  grabbers)  from  which  the  image  can  be  re- 
produced at  file  remote  display  terminal,  in  udiich  case  the  high  resolution  bandwidth  can  be  dedicated 
to  a user  for  only  one  frame  tranmission  time  of  about  30  milliseconds,  to  fill  the  storage  memory. 

In  addition,  if  slow  scan  techniques  are  used,  the  bandwidth  reqalreinent  can  be  reduced,  though  the 


frame  tranemiasion  time  would  be  increased  proportionately,  and  therefore  the  number  of  users  who 
could  be  serviced  on  a single  channel  in  a given  interval  would  also  be  reduced. 

Obviously  this  kind  of  system  offers  the  designer  room  for  compromise.  He  can  dedicate  the  band- 
width to  a single  user  or  to  multiple  users  who  want  to  view  the  same  frame;  he  can  frequency  divide  the 
band  into  several  narrower  channels  and  serve  a number  of  users  either  with  less  resolution  or  with 
slower  response;  he  can  time  divide  the  band,  or  the  separate  channels,  and,  if  necessary,  address 
selected  single  frames  to  individual  storage  devices. 

The  various  techniques  place  different  demands  upon  the  signal  transmission  and  reception  equip- 
ment, but  any  and  all  can  be  supported  by  a single  coaxial  cable.  The  selection  of  the  appropriate  com- 
bination of  techniques  to  satisfy  a given  organizational  need  can  be  based  upon  well  understood  tradeoffs 
between  coat  and  capability. 


VOICE  TELEPHONE  SERVICES 

In  the  conventional  telephone  system,  each  subscriber  has  a pair  of  wires  that  run  from  his  instru- 
ment to  a control  exchange  where  switched  connections  are  made  in  response  to  dial  signals.  In  all  but 
the  most  modern  exchanges,  switching  is  a mechanical  operation,  and  the  arrangement  is  relatively 
inflexible;  if  a subscriber  changes  his  location  the  bid  wires  must  be  physically  disconnected  and  the  new 
ones  connected  to  the  appropriate  terminals  in  the  exchange.  The  number  of  subscribers  who  can  be 
supported  is  limited  to  the  numerical  capacity  of  the  exchange  equipment.  Additions  are  expensive  and 
time  consuming,  and  yet,  because  most  telephones  are  used  for  at  most  a few  hours  per  day,  most  of 
the  equipment  and  lines  are  idle  for  the  greater  part  of  their  working  lives. 

A single  coaxial  cable  can  be  used  as  the  only  signal  path  to  service  a telephone  system.  In  one 
operating  commercial  system,  up  to  3000  telephones  can  be  supported;  more  are  obviously  possible  if 
bandwidth  is  allocated.  In  this  approach,  frequency  division  multiplexing  is  used  to  connect  different 
subscribers.  Switching  is  performed  by  frequency  selection  within  the  telephone  instruments,  under  the 
control  of  a central,  stored  progratm  minicomputer  that  allocates,  by  digital  switching,  the  calling  and 
called  instruments  to  any  vacant  channel.  Each  channel  is  the  analog  of  a wire  pair,  but  communications 
privacy  and  non-interference  are  obtained  by  frequency  division  rather  than  spatial  separation.  A 
channel  is  dedicated  to  a pair  (or  group,  if  the  conferencing  capability  is  being  used)  of  users  only  for 
the  duration  of  the  call;  it  is  then  available  for  use  by  another  pair,  so  the  common  equipment  can  have 
a much  higher  duty  cycle,  thus  giving  a better  return  on  the  investmc  it  than  conventional  switched  wire 
systems. 

Adding  new  subscribers  to  the  system  simply  requires  taps  into  the  cable,  and  a variety  of  useful 
services  can  be  provided  by  programming  in  the  minicomputer. 


DIGITAL  COMMUNICATIONS 

It  is  fairly  obvious  that,  if  the  bandwidth  of  the  cable  can  be  separated  into  discrete  channels  and 
modulated  to  carry  video  or  audio  signals,  then  digital  messages  can  also  be  carried  by  frequency  shift 
keying  and  similar  methods.  Conventional  techniques,  however,  have  the  disadvantage  that,  if  a very 
high  data  rate  channel  is  required  for  a short  time,  a disproportionate  amount  of  bandwidth  must  be 
allocated  to  it  - that  is,  other  services  may  be  impaired  because  of  the  need  to  keep  a fixed  capacity 
available,  even  though  it  may  be  used  infrequently,  and  the  user  pays  either  directly  or  indirectly  for 
the  capacity  available  rather  than  the  service  used. 

At  MITRE  an  attempt  has  been  made  to  solve  some  of  the  problems  by  developing  a flexible  system  in 
which  the  available  capacity  can  be  reallocated  according  to  current  needs.  This  is  a transparent  high- 
speed burst  transmission  system  that  is  designed  to  provide  fast  access  by  many  users  to  variable  data- 
rate  digital  channels.  The  system  could  operate  over  any  wideband  medium,  but  in  the  demonstration 
system  a coaxial  cable  is  used. 


MITRIX  DEMONSTRATION  SYSTEM 

MITRIX  is  a high-speed  burst  transmission  system  operating  on  a pair  of  cables  - inbound  and  out- 
bound (Figure  2).  In  the  demonstration  system  at  MITRE,  the  digital  communications  system  occupies 
only  a very  small  fraction  of  the  cable  bandwidth  - about  one  MHz,  but  this  is  sufficient  to  support  an 
aggregate  data  transmission  rate  exceeding  600  Kbps. 


Figure  2.  MITRIX  Functional  Configuration 


The  channel  carries  a serial  bit  stream  that  is  divided  into  frames  of  fixed  time  (and  data)  length. 
Each  of  these  frames  is  divided  into  a sequence  of  fixed  length  data  slots.  The  system  operates  as  a 
time  division  system  in  which  a user  is  allocated  certain  data  slots  in  each  frame  for  transmission. 
Allocation  of  the  slots  is  controlled  by  a network  control  center  according  to  the  data  rate  needs  of  the 
sender.  For  example  in  the  demonstration  system,  an  allocation  of  one  slot  per  frame  (i.  e.  a 256  bit 
burst  containing  192  information  bits  in  each  2.  56  second  frame)  corresponds  to  a data  rate  of  75  bps. 
Slots  may  be  allocated  in  multiples  of  2N,  thus  permitting  communication  at  150,  300,  etc.  to  a maxi- 
mum of  307,200  bps.  The  allocations  remain  in  effect  only  until  the  end  of  a transmission,  when  the 
slots  are  released  and  may  be  reallocated  to  a variety  of  other  senders. 

As  this  is  a bus  system,  every  user  receives  all  data  slots.  Selection  is  performed  by  the  receiving 
equipment  which  uses  part  of  the  data  in  each  slot,  the  address,  to  identify  the  slots  intended  for  it.  The 
address  can  designate  a single  user,  thus  effectively  giving  point-to-point  communication,  but  may  also 
indicate  a class  of  users  (including  "all")  and  thus  it  is  possible  to  distribute  data  in  conference  or 
broadcast  modes  or  to  poll  a group  of  users.  As  a receiver  selects  slots  of  interest  from  all  on  the  bus, 

' it  is  obviously  possible  that  the  receiver  may  receive  and  process  several  messages  at  different  data 

rates,  simultaneously. 

^ The  interface  between  the  user  and  the  cable  is  the  bus  interface  unit  (Figure  3),  a rather  sophisti- 

I cated  device,  but  one  that,  because  of  the  availability  of  low  cost  digital  integrated  devices,  need  not  be 

expensive.  The  bus  interface  units  are  connected  at  each  user's  location,  between  the  inbound  2tnd  the 
outbound  cable,  and  each  is  also  connected  to  one  or  more  terminal  devices.  The  unit  accepts  infor- 
mation from  an  input  device,  reorganizes  the  data  into  suitably  addressed  packages,  and  inserts  them 
into  the  inbound  cable  in  the  assigned  time  slots.  The  serial  bit  stream  formed  by  all  the  active  bus 
interface  units  passes  through  a digital  bus  repeater  on  to  the  outbound  cable,  from  which  individual 
packages  are  selected,  according  to  address,  by  other  bus  interface  units.  The  address  information  is 
then  removed  and  the  data  that  was  originated  by  the  input  device  is  reconstituted  and  passed  to  the 
^ receiving  device. 

The  Network  Control  Center  is  a special  terminal  device  which  performs  three  principal  functions: 
system  initialization  and  start  up,  changes  to  slot  allocations  to  accommodate  different  data  rates,  and 
V monitoring  and  fault  isolation.  The  Network  Control  Center  can  be  programmed  to  detect  terminal 

device  failures  and  to  reallocate  some  of  the  functions  of  a failed  unit  to  a substitute  device.  Therefore, 
^ in  systems  in  which  there  is  duplication  of  equipment,  the  system  can  be  protected  against  catastrophic 

failure  by  reconfiguring  the  network  for  continued,  even  though  degraded,  operation. 
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Figure  3.  Bua  Interface  Unit  - Functional  Diagram 


The  demonetration  system  operates  only  one  digital  bus,  at  about  600  Kbps  aggregate  data  rate,  but 
even  this  can  acconunodate  8000  users  at  the  minimum  data  rate  of  75  bps  or,  for  example,  64  users 
each  sending  simultaneously  9600  bps.  However,  this  bus  occupies  only  1 MHz  of  the  cable's  bandwidth. 
Obviously,  the  capacity  could  be  multiplied  by  assigning  more  bandwidth  to  a single  channel  so  as  to 
create  a higher  speed  bus,  or  by  assigning  more  channels  to  mtiltiple  busses.  In  the  latter  case,  there 
is  then  an  option  either  to  arrange  intercommunication  between  the  busses  or  to  isolate  some  busses 
from  others  for  privacy  or  security.  The  bus  is  a transparent  medium,  and  data  security  can  be 
obtained  by  employing  encryption  devices  between  terminal  devices  and  their  associated  bus  interface 
units. 


APPLICATIONS 

At  the  beginning  of  this  paper  it  was  stated  that  information  systems  are  now  being  viewed  in  a 
comprehensive  fashion  which  includes  audio,  video,  and  digital  means.  Some  of  the  techniques  in  which 
these  means  may  be  acconrunodated  on  a television  coaxial  cable  have  been  discussed,  and  it  was  pointed 
out  that  the  cable  is  merely  a radio- frequency  transmission  duct;  therefore  any  technique  that  can  be 
used  with  radio  can  be  used  on  the  cable  (Figure  4).  However,  the  cable  is  essentially  free  of  inter- 
ference and  can  sustain  a high  signal-to-noise  ratio.  Moreover,  the  cable  leads  the  signal  where  it  is 
wanted  and  puts  the  whole  capacity  of  the  300  MHz  bandwidth  at  the  disposal  of  each  one  of  the  using 
group.  The  bus  architecture  and  a MITRIX-like  time  division  multiple  access  organization  provides  an 
extremely  versatile  capability  that  will  support  complex  multimode  systems. 

Cable  may  not  be  economical  for  systems  where  users  are  tightly  grouped,  for  example,  immedi- 
ately around  a computer  room  or  in  a number  of  widely  separated  tight  groups;  in  these  cases  point-to- 
point  connections,  or  a high-speed  parallel  bus  may  be  appropriate. 

For  operation  over  long  distances,  the  cost  of  laying  the  cable  and  installing  the  amplifiers  may 
exceed  that  of  microwave  and  other  long  distance  systems.  Fven  if  cable  is  economically  feasible, 
there  may  be  additional  problems  of  obtaining  accei>s  to  existing  poles,  or  obtaining  rights  of  way. 

At  present,  the  most  appropriate  uses  of  cable  appear  to  lie  in  large  buildings  or  sp-^tially  compact 
areas  where  there  is  a clear  need  for  a wide  variety  of  communications  services.  Examples  are 
military  bases  and  government  office  buildings,  and  in  the  private  sector,  hospitals,  large  insurance 
company  headquarters,  highly  automated  manufacturing  complexes  and  similar  organizations.  In 
particular,  if  the  community  is  already  provided  with  a coaxial  cable  for  entertainment  television  or  for 
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a aurveillance  system,  the  use  of  the  existing  cable  to  support  other  services  becomes  even  more 
attractive. 
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Figure  4.  Demonstration  System  Circuit  Configuration 

For  systems  with  a very  small  number  of  users,  the  cable  bus  offers  no  special  advantages.  For 
as  few  as  20  - 40  users  (for  example,  large  computers,  distributed  terminals  or  minicomputers,  etc.) 
the  economics  favor  the  cable  bus.  The  crossover  point  will  depend  upon  many  factors,  including  the 
nature  of  the  existing  facilities,  the  complexity  of  the  interconnections  required,  and  the  physical 
distances  involved. 
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Considering  only  a low  or  medium  speed  digital  communioations  subsystem,  and  restricting  the 
analysis  to  communications  equipment  alone,  the  expected  costs  are  shown  in  Figure  5.  This  diagram 
shows  the  effect  of  adding  equipment  to  a basic  front  end  processor  as  the  number  of  subscribers  is 
increased.  In  the  conventional  approach  the  additions  are  line  controllers,  which  handle  16  line  incre- 
ments, modems  and  couplers.  In  the  cable  bus  approach  a cable  distribution  system,  a network 
control  computer,  and  bus  interface  units  would  be  required.  For  costing,  a 1.  S mile  cable  run  has 
been  assumed  to  be  necessary;  the  modem  costs  assume  a representative  mix  of  data  rates  on  voice- 
grade  telephone  lines,  but  the  telephone  equipment  costs  were  not  included.  Also,  the  cost  of  providing 
high  data  rate  communications  in  the  conventional  system  have  been  excluded;  this  capability  is,  of 
course,  an  intrinsic  feature  of  the  MITRIX  time  division  design  and  adds  no  cost  to  the  cable  system. 
Each  installation  is,  of  course,  different,  but  the  figure  is  representative  of  cost  studies  performed 
by  MITRE  for  specific  installations  and  itlustrates  the  attractiveness  of  the  cable  bus  approach. 

Not  the  least  important  feature  of  the  bus  architecture  is  that  the  number  of  connections  and  the 
mass  of  wire  that  are  employed  is  much  less  than  those  in  conventional  systems.  To  connect  N 
terminals  to  a switch  requires  2N  connections  - to  connect  them  to  a bus  requires  only  N+1  (one  for  the 
network  controller).  However,  the  bus  provides  unlimited  connectivity,  which,  in  the  conventional 
approach  requires  N(N-l)  connections  per  circuit.  In  applications  such  as  mobile  tactical  systems,  in 
which  communications  must  be  disconnected  before  each  move,  and  reconnected  afterwards,  the  re- 
duction in  work  and  in  potential  error  or  mechanical  damage  is  obviously  advantageous.  In  the  case  of 
the  AN/TSQ-91  Control  and  Reporting  Center  used  by  Tactical  Air  Command,  it  has  been  determined 
that,  if  the  bus  architecture  had  been  used,  instead  of  point-to-point  communications,  the  number  of 
circuit  connections  would  be  26  ii.'*'ead  of  2800,  only  13  cables  would  have  been  required  instead  of  95, 
and  the  weight  of  cables  would  have  been  reduced  by  over  two  tons. 


i^r.Tyt-rrfy.ffFJt3;s 


\000  PER  TERMINAL 


w 


BUS  INTERFACE  UNITS  ^ 
$500  PER  terminal 


NETWORK  CONTROL  MINICOMPUTER  - $20,000  C 


1.5  MILE  CATV  OISTReUTION  NETWORK  - $15000 


PROCESSOR  - $800^ 


NUMBER  OF  SUBSCRIBERS 


NUMBER  OF  SUBSCRIBERS 


CONVENTIONAL  APPROACH 
(ASSUMES  TELEPHONE  CIRCUITS  USED  AT  NO  COST) 


BUS  APPROACH 


Figure  5.  Cost  Summary  - Conventional  vs.  Cable  Bus  Systems 

The  demonstration  system  at  MITRE  has  shown  that  the  use  of  cable  to  support  a multimedia  en- 
vironment is  practical  and  economical.  For  the  future,  several  potential  applications,  some  of  which 
are  discussed  below,  are  being  pursued. 


Army  Base  Information  Transfer  System  (ARBITS) 


The  Army  Base  Information  Transfer  System  is  a major  study,  supported  by  the  U.  S.  Army  and 
being  conducted  by  MITRE, which  is  aimed  at  the  implementation  of  integrated  communications- 
electronics  systems  to  satisfy  the  comprehensive  information  processing  and  transfer  requirements  of 
major  Army  installations. 

It  is  proposed  to  continue  practical  studies  of  the  applicability  of  the  multimedia  bus  architecture  in 
a test  bed  facility  at  Fort  Bliss,  Texas.  Extensive  studies  of  this  base  indicate  that  the  technology 
should  prove  beneficial  in  several  applications  areas,  such  as: 

Remote  entry  and  retrieval  of  digital  data. 

Forms  processing,  text  editing,  correspondence  and  message  distribution. 

Data  base  management,  computer  programming,  and  report  generation. 

Computer-assisted  instruction,  computer  graphics. 

Digital  telemetry  and  control. 

Computer  output  microfilm,  remote  retrieval  and  update  of  micro- form  data,  micropublishing. 

Facsimile  transmission. 

Fixed-frame  and  broadcast  video,  interactive  access  to  video,  surveillance  and  teleconferencing. 

Secure  processing  of  voice  and  digital  messages. 

Switchless  telephony. 

Linking  of  telephone  system  to  non-tactical  radio  nets. 


Government  Classified  Facilit 


The  application  of  a multimedia  cable-based  information  distribution  system  has  been  studied  for  a 
classified  headquarters  facility.  The  study  shows  that  the  expressed  requirement  for  five  categories  of 
information  transfer  can  all  be  supported  simultaneously  by  a single  coaxial  cable-based  network. 

These  comprise: 


Digital  communications  among  computer  terminals  and  computer  complexes  (six  IBM 
370-168  or  larger  computers  and  over  700  terminals). 

Document  distribution  communications  - distribution  of  document  images  originating 
in  a microfiche  scanning  facility  to  800  remote  terminals. 

Other  digital  requirements,  including  inter-building  data  transfer  and  high-speed 
facsimile. 

Communications  among  7000  secure  telephone  sets. 

Miscellaneous  requirements,  such  as  high  bandwidth  image  transfer,  standard  television 
video  transmission  from  the  air  or  tapes. 

The  digital  requirements  can  be  accommodated  within  less  than  10  MHz  bandwidth  and  will  operate 
at  7.4  Mbps  with  about  50%  reserve  capacity.  This  capability  is  within  the  current  capabilities  of  low 
cost  digital  devices. 


University  of  Vermont 

The  PRoblem  Oriented  Medical  Information  System  (PROMIS)  that  is  being  developed  at  the  Uni- 
versity of  Vermont  requires,  among  other  things,  high-speed  digital  data  communications  to  ensure 
that  the  computer  display  terminals  are  responsive  to  the  needs  of  the  medical  staff,  MITRE  has 
designed  and  built  the  prototype  modems  for  a 300  Kbps  cable-based  system.  This  is  a polling  system 
and  the  prototypes  will  be  used  for  proof- of- concept  testing  of  the  techniques.  Low  cost  digital 
circuitry  was  used  for  the  modems. 


Modular  Control  Center 

A study  at  MITRE,  for  the  United  States  Air  Force,  is  concerned  with  the  development  of  a system 
concept  employing  interconnected  standard  modules  to  support  command  and  control  functions.  When 
successful,  this  approach  could  reduce  the  difficulties  of  acquisition,  logistic  support,  and  maintenance 
of  equipment,  while  providing  redundancy  which  will  facilitate  the  creation  of  fail- soft  systems. 

Studies  have  shown  that  a cable  bus  acting  as  the  interconnecting  vehicle  would  reduce  weight,  save 
set-up  time,  improve  reliability,  increase  flexibility  and  facilitate  reconfiguration. 


CONCLUSIONS 

Traditional  communications  architectures  are  not  adequate  to  support  the  transparent,  efficient- 
expandable  and  easily  integrable  networks  that  are  needed  to  exploit  new  information  system  concepts. 
New  ideas  are  needed  which  will  create  new  structures  for  distributing  information,  ideas  that  will  let 
us  tie  things  together  in  easier,  cheaper  and  more  flexible  ways  than  the  usual  point-to-point,  switched 
lines. 

Computers  are  no  longer  limited  to  expensive  central  facilities.  The  minicomputer  and  the  micro- 
processor are  making  it  possible  to  put  substantial  computing  power  into  many  more  hands,  and  to 
bring  the  computer  to  the  user,  rather  than  the  reverse.  There  is  a strong  thrust  towards  distributed 
processing,  dispersing  work  in  specialized  machines  at  many  locations.  Efficiency  and  accuracy  can 
be  maintained  only  if  Aese  processors  can  be  interconnected  economically. 

In  military  systems,  the  availability  of  extremely  capable  but  inexpensive  digital  devices  gives  the 
power  to  develop  systems  of  unprecendented  capability  and  survivability.  It  will  be  possible  to  con- 
struct systems  from  smaller,  cheaper  building  blocks  that  can  be  put  together  in  more  flexible,  more 
reliable  and  more  survivable  systems  - but  only  if  we  have  more  flexible  communications  to  support 
them. 

Digital  systems  are  not  the  whole  story.  Information  systems  are  being  built  which  exploit  the 
whole  range  of  information  handling  techniques  - not  only  digital,  but  video,  audio  and  photographic 
recording,  retrieval  and  distribution.  Such  systems  require  a comprehensive  communications  system 
to  support  them. 

Coaxial  cable  systems,  using  multimedia  bus  technology,  can  provide  the  required  communications 
support.  The  bandwidth  is  sufficient  to  support  extremely  demanding  information  transfer  require- 
ments. The  technology  is  well  proven;  reliable,  standard  commercial  equipment  has  been  available 
for  several  years  to  support  high  signal-to-noise  ratio,  interference  free  transmission.  Digital  appli- 
cations are  also  well  proven,  but  their  use  has,  until  recently,  been  inhibited  by  cost  considerations. 
However,  the  recent  growth  of  low  cost  digital  integrated  devices  has  removed  the  cost  barrier  - the 
economics  now  favor  the  digital  cable  for  many  installations.  The  costs  of  digital  logic  are  expected 
to  continue  to  fall,  whUe  there  is  every  indication  that  installation  and  maintenance  costs,  which  both 
have  heavy  manpower  components,  will  rise.  The  cost  tradeoff  should  therefore  be  even  more 
favorable  in  the  future. 
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TERMINAL  ACCESS  TECHNOLOGY  OF  THE  1990s 
by  CRAIG  FIELDS* 

INTRODUCTION 

I am  going  to  make  some  predictions  about  characteristics  of  terminal  access  tech- 
nology of  the  1990s.  The  usual  way  to  predict  future  technology  Is  to  forecast  what 
will  be  technically  possible.  This  Is  an  unsatisfactory  approach,  because  Che  number 
of  technological  possibilities  for  the  1990s  Is  truly  vast,  and  most  will  never  reach 
the  market  place,  much  less  become  popular  products  In  common  use.  The  technology  of 
terminals  In  Che  1990s  will  be  determined  by  Che  R&D  Investment  strategy  pursued  by 
Che  computer  Industry  during  the  next  two  decades.  This  Investment  strategy.  In  turn, 
will  depend  on  what  the  computer  Industry  thinks  customers  will  want  In  computer  ter- 
minals of  the  1990s.  Projecting  what  people  will  want  In  computer  terminals  of  the 
1990s  would  seem  to  be  an  even  harder  job  than  projecting  technological  possibilities 
for  the  1990s.  Fortunately,  terminals  actually  available  twenty  years  from  now  will 
be  defined  by  R&D  currently  being  pursued.  It  Is  only  necessary  to  observe  what  R&D 
managers  currently  think  people  will  want  In  the  1990s,  and  that  Is  very  likely  what 
they  will  get.  As  an  R&D  manager,  1 am  going  to  present  my  own  views  of  future  desires 
for  computer  terminals,  and  I have  not  tried  to  make  an  opinion  survey  among  my  col- 
leagues . 

I have  been  careful  Co  refer  to  future  desires  of  customers,  and  not  future  needs. 
People  are  not  very  good  at  estimating  the  most  cost  effective  computer  technology  to 
help  them  do  their  jobs.  Supposed  experts  usually  disagree,  and  very  few  scientific 
experiments  have  been  done  relating  characteristics  of  computer  systems  to  Improvements 
In  job  performance.  In  those  few  cases  where  experiments  have  been  performed,  results 
have  been  uniformly  counter-lnCultlve.  Examples  of  results  Include  slow  time  sharing 
systems  are  no  worse  for  problem  solving  chan  fast  time  sharing  systems;  video  mediated 
communications  are  no  better  than  voice  mediated  communications  for  teleconferencing; 
large  vocabularies  for  speech  communication  with  computers  are  no  better  than  small 
vocabularies  for  speech  communication  with  computers;  some  times  data  Is  better  presen- 
ted as  numbers  than  as  graphs;  graphical  presentation  using  symbols  Is  frequently 
better  than  graphical  presentation  using  colors;  less  computer-presented  data  Is  fre- 
quently better  for  decision  making;  and  so  on.  The  Inescapable  conclusion  Is  that 
experimentation  Is  a better  approach  to  designing  systems  than  meditation.  Fortunately 
or  unfortunately.  It  Is  a lot  easier  to  find  out  what  people  want  than  what  they  really 
need,  and  what  they  want  defines  the  market  place. 

Terminal  access  technology  of  the  1990s  will  differ  for  various  jobs  and  workers. 
For  this  paper  1 will  be  exclusively  concerned  with  office  work,  a cerebral  function 
where  brains  are  more  Important  than  brawn,  thinking  more  Important  than  sharp  eyes 
and  sensitive  ears. 

I want  to  describe  several  kinds  of  activities  that  comprise  office  work,  and 
discuss  the  ways  that  computer  technology  In  the  1990s  will  aid  those  activities.  The 
most  Important  activity  In  office  work  Is  thinking.  It  Is  essential  that  computer 
programs  be  developed  to  aid  people  In  thinking  or,  preferably,  think  for  people.  The 
reason  for  my  concern  Is  subtle  but  Important.  I believe  It  Inevitable  that  the  prob- 
lems facing  society  In  the  1990s  will  require  solutions  that  are  beyond  the  Intellec- 
tual capability  of  any  lndlvldual--person  or  computer.  Both  people  and  computers  have 
fundamental  physical  limits  that  they  face.  For  computers  that  limit  Is  determined 
by  the  limited  speed  of  light  (only  one  foot  per  nanosecond)  and  atomic  limits  on  the 
smallest  scale  computing  devices.  I cannot  easily  characterize  the  forces  determining 
the  fundamental  limits  on  human  Intellect,  but  I am  sure  they  exist.  In  any  case. 

If  Important  problems  requiring  solutions  are  beyond  the  capabilities  of  Individuals, 
they  must  be  solved  by  groups.  We  know  a reasonable  amount  about  how  to  combine  com- 
puters together  In  order  to  get  Increasing  computational  power.  Although  all  algo- 
rithms do  not  benefit  from  multi-processor  execution,  an  Increasing  number  can  be 
speeded  up.  In  contrast,  we  know  almost  nothing  about  how  to  combine  people  together 
In  order  to  get  Increased  Intellectual  capability.  A ten-member  committee  Is  not  ten 
times  as  Intelligent  as  any  of  Its  members.  Since  we  have  not  discovered  how  to  com- 
bine the  Intellectual  capabilities  of  people  after  thousands  of  years,  I have  little 
confidence  that  we  will  solve  this  problem  during  the  next  two  decades.  Computer  re- 
lacement  for  human  thinking  Is  being  vigorously  pursued  with  limited  success  In  the 
field  of  artificial  Intelligence.  Computer  aids  for  human  thinking  are  beginning  to 
appear,  ano  Include  modeling  tools  (especially  heuristic  modeling  tools),  decision 
making  tools,  and  programs  to  assist  people  In  plausible  reasoning.  In  summary.  In- 
creasing scale  of  problems  requires  technology  for  upward  scaling  of  computer  power  (as 
a replacement  for  human  thinking)  or  a new  approach  to  upward  scaling  or  human  Intelli- 
gence (one  approach  Is  through  computerized  teleconferencing  systems  that  actively  aid 
group  processes  as  described  below) . 

Conferencing  Is  going  to  be  an  Increasingly  Inportant  function  In  office  work 
during  the  next  two  decades.  The  expertise  of  large  numbers  of  different  kinds  of  spe- 
cialists will  be  required  to  solve  Increasingly  complicated  social  problems.  Further, 
these  experts  will  Inevitably  be  geographically  distributed,  requiring  teleconferencing 
systems.  Computers  are  not  necessary  In  teleconferencing  systems  with  adequate  band- 
width (voice,  video,  and  so  on)  to  mimic  face-to-face  conferences  although  computers 
can  reduce  the  cost  of  high  bandwidth  teleconferences  by  compressing  communications 
using  redundancy  Inherent  In  human  discourse.  Computers  are  necessary,  however,  as 
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part  of  taleconferencing  syateas  that  contain  adequate  procedurea  for  controlling  the 
conference  (chairman,  agenda,  racognltlon  of  rank  of  apeakera , . . . ) and  good  recording 
toola  (treating  the  conference,  and  parta  of  the  conference,  aa  data),  aa  well  aa  In 
providing  apeaker  aide  auch  aa  alectronlc  hlackboarda  and  polntara,  and  electronic  elide 
projectors.  Such  teleconferencing  aystema  will  be  aimed  at  providing  high  fidelity  con- 
ferences that  appear  that  almost  as  If  they  were  taking  place  In  one  room.  I contend 
that  If  high  fidelity  teleconferencing  Is  a goal  of  computer  technology,  we  will  never 
achieve  a solution  to  the  scaling  problem  1 have  dlscusaed--how  to  combine  people  to- 
gether to  get  Increasing  capabllltlas.  Rather,  I advocate  the  development  of  low 
fidelity  teleconferencing,  where  the  communications  stream  Is  purposely  distorted  by 
computers  In  teleconferencing  system  In  order  to  Improve  communication  and  group  problem 
solving.  I want  to  Illustrate  this  concept  with  ten  short  examples. 

(1)  People  write  and  speak  with  a distinctive  style  that  Is  as  Idiosyncratic  as  a 
fingerprint.  It  Is  likely  that  people  can  better  understand  communications  In  their  own 
style  than  In  anyone  else's  style.  Computers  In  a teleconferencing  system  could  trans- 
form communications  from  one  person’s  style  to  snother,  thus  aiding  the  group  decision 
making  process. 

(2)  An  argument  can  be  presented  In  any  number  of  ways,  and  the  rhetoric  selected 
will  depend  on  the  point  to  be  made  and  the  audience  that  Is  listening.  A computer  pro- 
gram could  serve  to  adlvse  speakers  In  a teleconference  on  the  best  form  of  argument  to 
use  In  order  to  achieve  concensus. 
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(3)  Jargon  and  abbreviations  are  an  Invaluable  communications  aid  within  a group, 
but  raise  a significant  barrier  to  communications  between  groups.  An  abbrevlator/ 
deabbrevlator  program  within  a teleconferencing  system  could  speed  up  communication 
between  Individuals  sharing  concepts  In  common  (and  even  create  abbreviations  for 
frequently  used  phrases)  and  could  facilitate  communications  among  strangers. 

(4)  A computerized  diplomat  could  detect  Insults,  snide  remarks  and  profanity  In 
the  communications  during  a conference,  and  filter  them  out  In  order  to  aid  the  group 
problem  solving  process.  This  Is  a function  usually  performed  by  translators  at  Inter- 
national negotiation  sessions. 

(5)  When  people  deceive,  they  reveal  themselves  through  various  clues.  These 
Include  physiological  Indicators,  nonverbal  behavior,  and  changes  In  communications 
style.  Computer  programs,  working  for  lndlv,ldual  members  of  a teleconference,  could 
alert  them  when  they  receive  potentially  deceptive  communications  from  others  at  the 
meeting. 

(6)  Some  people  talk  rapidly  and  others  talk  slowly.  It  Is  reasonable  to  believe 
that  fast  speakers  are  fast  listeners,  and  slow  speakers  are  slow  listeners.  A tele- 
conferencing system  can  serve  like  the  transmission  In  an  automobile  to  speed  up  and 
slow  down  communications  among  different  members  of  a conference. 

(7)  Host  of  the  contents  of  messages  and  briefings  are  Intuitively  obvious,  and 
only  a small  percentage  of  the  communications  are  devoted  to  new,  surprising  facts. 
Insights  or  opinions.  A computerized  counter-intuitive  filter  would  serve  an  Important 
function  In  allowing  the  unpredictable  material  to  reach  a member  of  a teleconference, 
while  relieving  him  of  the  burden  of  listening  to  material  he  would  otherwise  expect 
anyway. 

(8)  Psychologists  have  developed  a number  of  rules  for  managing  groups  Involved 

In  brainstorming.  These  rules  could  Include  suppressing  criticism,  thinking  through 
metaphors,  generating  as  many  Ideas  as  possible,  and  so  on.  Such  rules  could  be  Imposed 

by  a teleconferencing  system  to  aid  the  creative  process. 

(9)  Each  member  of  a decision  making  group  has  his  own  decision  model.  That  model 

could  be  revealed  during  a teleconference  to  aid  concensus  seeking  by  Identifying  areas 

of  significant  differences  of  opinion  and  quickly  eliminating  areas  of  agreement. 

(10)  Beautiful  and  handsome  conference  participants  who  are  well  groomed  are  more 
credible  and  believable  than  their  less  fortunate  colleagues.  A video  based  telecon- 
ferencing system  could  Include  picture  processing  software  that  would  beautify  confer- 
ence participants,  thus  aiding  them  In  stating  their  case. 

Document  production  Is  an  exceedingly  mundane  but  central  function  of  office  work. 
Computerized  editors,  spelling  correctors,  grammarlzers  and  format  neateners  are  Inev- 
itable developments  In  computerized  office  work  of  the  1990s,  Documents  themselves 
will  change,  becoming  multi-media  vehicles  mixing  text,  voice,  printing,  script,  line 
drawings,  and  half-tone  pictures. 

Electronic  mall  Is  a nonreal-tlme  analog  of  teleconferencing,  and  Is  now  available 
In  limited  use.  Nickname  addressing,  forwarding,  forced  routing,  and  Importance  alert- 
ing and  prioritizing  are  here  now.  Functions  such  as  redundancy  filtering  to  eliminate 
unnecessary  repetition  of  messages  will  be  avsllable  during  the  coming  decades. 

Briefings  are  a special  form  of  conference  that  play  an  essential  role  In  office 
work.  The  briefer  nay  be  viewed  as  a guide,  leading  the  audience  through  a data  terri- 
tory with  the  goal  of  convincing  the  audience  to  adopt  a point  of  view.  A good  guide 
modifies  his  route,  adapting  to  the  responses  of  the  audience.  In  the  1990s  this  will 
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be  facilitated  because  audiences  will  be  Instruaentad  to  appraise  speakers  of  their 
reactions  and  briefings  nay  be  autoaatlcally  personalised  to  specific  Individuals  in 
the  audience. 


Library  searching  Is  achieving  increasing  inportancy  In  office  work  as  larger  and 
larger  conputerlzed  data  bases  become  comnonly  available.  In  the  1990s,  the  structure 
and  directory  of  data  bases  will  be  personalized  for  Individuals  and  nade  congruent  with 
human  nenory.  More  Intelligent  data  base  systems  may  be  ezpacted  that  accept  complez, 
short  high  level  questions  from  the  office  worker  and  return  brief,  high  level  answers. 

The  office  workers  that  I am  Interested  In  are  decision  makers  and  their  staffs. 

For  these  Individuals,  the  value  of  their  decisions,  mistakes,  and  opportunities  seized 
or  lost  is  far  greater  than  the  cost  of  the  computer  systems  they  will  use  In  solving 
problems.  The  Importance  of  their  work  coupled  with  the  decreasing  cost  of  electronic 
devices  based  on  Improvements  In  LSI  technology  and  mass  production  assure  that  exceed- 
ingly fancy  terminals  will  be  cost  effective  for  selected  office  workers  In  the  1990s. 

My  approach  to  defining  desires  for  terminal  access  technology  in  the  1990s  Is 
based  on  the  observation  that  computerization  of  office  function  has  caused  big  Improve- 
ments In  a number  of  areas,  but  has  also  lead  to  significant  losses  In  capability  be- 
cause of  some  characteristics  of  computer  terminals.  People  are  now  overcoming  their 
Initial  Infatuation  with  automation.  I believe  that  the  trend  of  the  coming  decades 
will  be  to  regain  loss  capabilities  In  an  electronic  context — If  you  like,  to  reinvent 
the  last  four  thousand  years  of  office  work.  I do  not  maintain  an  anti-technology 
position.  New  advances  always  result  In  some  losses.  Computerization  Is  unique  in 
that  we  can  regain  what  has  been  lost. 

I have  Identified  three  areas  In  which  new  computer  terminals  for  the  1990s  will 
help  us  to  regain  old  office  capabilities.  We  will  regain  some  of  the  advantages  of 
pre-computer  communications,  some  of  the  special  advantages  of  working  with  paper,  and 
some  of  the  advantages  of  the  work  environment  before  computer  terminals. 

THE  REBIRTH  OF  ENRICHED  COMMUNICATIONS 

At  this  time  there  are  very  few  possibilities  for  man-computer  interaction.  The 
technologies  available  Include  typing;  drawing  and  pointing  with  a stylus;  and  color 
alphanumerical  or  graphic  displays.  Simple  trends  of  the  near  future  are  to  Increase 
the  size  of  the  display  and  to  replace  today's  fixed  alphabet  keyboards  with  dynamic 
adaptive  keyboards  presenting  a changing  alphabet  of  possibilities  depending  on  the 
kinds  of  things  people  are  typing  at  any  particular  time. 

Computers  are  getting  to  be  more  like  people.  The  field  of  artificial  Intelligence 
Is  always  ten  years  behind  Its  aspirations  and  twenty  years  behind  its  publicity,  but 
It  Is  making  progress.  As  computers  become  smarter  and  more  like  humans,  people  will 
want  to  regain  the  enriched  communications  possibilities  they  now  enjoy  with  other 
humans  but  not  with  computers.  As  computers  become  smarter,  they  will  be  able  to  make 
good  use  of  new  channels  of  communication. 


What  channels  of  communication  do  people  use.  Verbal,  overt  channels  Include 
writing  and  printing;  speaking;  drawing  and  pointing  with  fingers  as  well  as  a stylus; 
painting;  and  gesturing  and  moving,  as  In  pantomime.  Non-verbal  covert  channels  of 
communication  Include  eye  motion,  eye  expression,  and  pupil  size;  speech  clues  like 
pausing  and  accents;  graphology  of  printing  and  script;  facial  expression,  "type  of 
face",  make-up,  moustaches  and  beards;  blushing;  sweating  and  body  odor;  small  body 
movements;  posture;  muscle  tension  and  shivering;  and  social  Indicators  such  as  dress 
and  haircut.  It  Is  technically  possible  to  communicate  all  of  these  types  of  informa- 
tion to  computers  right  now,  but  we  do  not  know  how  to  write  computer  programs  to  Inter- 
pret and  make  use  of  the  Information. 

In  addition  to  these  modes  of  communication  currently  In  use  between  people  and 
potentially  used,  through  terminals,  between  people  and  computers,  I cannot  resist 
describing  a new  alternative.  Human  brain  waves,  the  electroencephalogram  or  EEC,  Is 
easily  measured  electronically.  It  Is  not  a communication  channel  used  by  people,  but 
It  could  be  used  between  people  and  computers.  It  would  have  the  greatest  communication 
bandwidth  of  any  communication  channel,  the  shortest  latency,  the  most  comprehensive 
code  (Including  messages  otherwise  unavailable) , the  easiest  transmission  (in  energy 
terms),  and  would  raqulro  the  least  training.  Unf ortuantely , the  BEG  has  a terrible 
slgnal-to-nolse  ratio.  Nevertheless,  I have  been  supporting  some  research  to  try  to 
learn  the  EEC  coda  and  usa  the  EEC  as  a communications  channel  between  men  and  com- 
puters. We  have  made  surprising  progress  and  are  now  able  to  get  useful  information 
from  the  EEC  about  people's  attention  (how  much,  to  what),  fatigue,  processing  load 
and  reserve,  thoughts  of  apeclflc  words,  connotation  In  communication,  and  surprise,  to 
give  a few  examples.  The  technology  for  doing  this  Is  easy,  and  the  algorithms  for  In- 
terpreting the  EEC  have  been  drawn  from  everyday  statltlstlcs . EEC  can  be  measured  from 
electrodes  Imbedded  in  the  headband  of  hi-fi  speakers  and  magnetic  brain  waves  can  be 
measured  without  any  alactrodss  by  placing  an  appropriate  sensor  near  the  head.  This 
raises  the  specter  of  covert  mind  reading,  but  I assure  you  that  Is  not  a real  danger. 
Magnetic  particles  in  the  hair  and  clothing  produce  much  larger  fields  than  the  brain, 
and  In  order  to  pick  up  magnetic  brain  waves  it  Is  necessary  to  demagnetize  the  head 
and  bacome  magnetically  naked. 


In  ■unary,  I believe  terminal  acceaa  technology  of  the  1990a  will  make  uae  of  a 
rich  aaaortment  of  verbal  and  nonberbal  channela  of  communication  ao  that  man-computer 
communication  will  be  aa  rich  aa  man-man  communication  la  now. 

REGAINIHG  THE  ADVANTAGES  OF  PAPER 

Paper  la  a marveloua  medium  for  office,  work,  and  we  have  loot  a number  of  Ita 
advantagea  with  the  advent  of  CRT  termlnala  or  even  with  the  uae  of  printing  terminals. 
Before  computer  termlnala  were  uaed  for  office  work,  we  purpoaely  or  accidentally  used 
many  capabilities  of  paper.  Fortunately,  these  can  be  regained  through  properly 
designed  terminals  of  the  1990s. 

It  Is  easy  to  Include  Important  types  of  Information  on  paper,  such  as  annotations 
and  logos.  These  ace  usually  lost  In  computerized  documents. 

In  addition  to  overt  means  of  representing  Information,  paper  documents  contain  a 
number  of  Important  covert  clues.  These  Include  the  shape  of  text;  the  color  of  text 
and  paper;  the  type  of  style  of  annotation.  Including  writing  e.g.  printing  or  script 
containing  their  own  clues,  rubber  stamps,  stickers  and  seals;  water  marks;  additions 
to  text  showing  temporal  Information;  paper  documents  as  an  original  or  a copy;  the 
type  of  binding,  such  as  paper  clips,  staples,  glue,  or  sewing;  perfumed  paper;  the 
weight  and  texture  of  paper;  or  the  existence  of  covers  or  end  papers.  This  list  does 
not  exhaust  the  clues  In  paper  documents,  but  It  gives  some  Indication  of  their  wealth 
and  variety. 

Paper  can  be  manipulated  In  a number  of  ways.  It  can  be  placed  In  a particular 
location,  stacked,  bound,  cut  and  pasted.  Stacks  can  be  flipped  or  looked  at  end-on. 

These  manipulations  that  are  possible  with  paper  allow  people  to  retrieve  paper 
documents  using  geography  as  well  as  symbols.  I locate  documents  by  where  they  are  In 
my  office — In  specific  piles  on  particular  places  In  my  bookshelves,  or  In  particular 
locations  In  my  files.  The  ability  to  manipulate  paper  allows  me  to  get  close  to  the 
document  of  choice,  and  the  use  of  paper  clues  allow  me  to  zero  In  on  the  particular 
document  I am  trying  to  find.  This  geometric  or  geographic  approach  to  paper  document 
management  utilizes  human  place  memory.  Human  place  memory  Is  very  good.  Indeed.  One 
standard  technique  to  teach  people  to  have  better  memory  Is  to  suggest  that  they  Imagine 
a room,  and  place  things  that  they  want  to  remember  at  different  places  In  the  room. 

From  a Darwinian  evolutionary  viewpoint  It  Is  easy  to  see  why  human  place  memory  might 
be  considerably  better  than  human  symbolic  memory. 

Fortunately,  computer  technology  makes  It  possible  for  us  to  have  the  best  of  both 
worlds.  It  Is  easy  to  Imagine  terminals  that  would  allow  us  to  enjoy  the  advantages  of 
paper  as  well  as  the  advantages  of  electronic  storage.  I want  to  describe  a mythical 
but  technically  feasible  data  management  system  based  on  human  place  memory.  It  is  a 
data  management  system  for  storing  text  and  facts  geographically  as  well  as  symbolical- 
ly. I will  say  nothing  more  about  the  symbolic  aspects  of  the  storage  system,  since 
they  are  thoroughly  conventional.  With  the  geometric  version  of  the  system.  Information 
Is  stored  at  particular  places  in  a two-dimensional  or  three-dimensional  space  simulated 
by  the  computer  storing  the  Information.  The  screen  of  the  user's  terminal  Is  concld- 
ered  a window  on  that  space  and  the  window  can  move  through  space,  controlled  by  move- 
ments of  the  hands,  eyes,  and  body  of  the  user.  Movements  Include  translation,  zooming, 
and  rotation.  Navigation  can  be  accomplished  in  a variety  of  ways  based  on  a grid 
gradient  throughout  the  space,  a color  gradient  throughout  Che  space,  colored  regions, 
and  various  landmarks  and  roads.  Information  at  various  places  can  be  stacked  Just 
like  documents,  and  can  be  given  all  of  the  previously  described  paper  clues  that  are 
basically  visual.  With  such  as  system  it  would  be  possible  for  the  user  to  enter  data, 
retrieve  data,  move  data,  flip  through  data,  search  through  a pile  of  data,  and  so  on. 
Data  In  the  space  could  be  reorganized  to  take  on  a personalized  view,  and  yet  maintain 
symbolic  links  to  facilitate  sharing  and  communication  among  users.  This  system  Is 
within  the  current  state-of-the-art  and  is  under  construction.  Terminals  of  the  1990s 
will  have  to  support  the  facilities  demanded  of  such  geographic  data  management  systems 
in  order  to  regain  a number  of  the  advantages  of  paper. 

Regaining  Che  advantages  of  Che  paper  will  increase  communication  bandwidth  on 
particular  subjects:  ease  document  retrieval;  ease  document  construction;  Improve 

authentication  of  the  source  of  text;  and  provide  new  communications  about  the  sender, 
the  sender's  view  of  the  subject;  and  the  sender's  view  of  the  recipient. 

REVIEWING  THE  WORK  ENVIRONMENT 

With  current  computer  terminals,  office  work  Is  done  a^  the  computer  terminal. 

The  restricted  computer  terminal  work  environment  Is  very  dlferent  than  pre-computeriza- 
tion work  environments,  and  distinctly  worse.  The  older  non-eleccronlc  alternative  is 
to  have  specific  work  places  considerably  larger  than  a terminal;  and  to  be  able  to  work 
anywhere.  Work  places  would  Include  a desk,  filing  cabinet,  tables,  blackboard,  bulle- 
tin board,  conference  table,  and  so  on.  The  ability  to  work  anywhere  stems  from  the 
portability  of  pencils,  paper,  books,  and  so  on. 

The  concept  of  specific  work  places  may  appear  to  be  In  conflict  with  the  idea  of 
working  anywhere.  That  is  the  case,  and  yet  both  possibilities  have  advantages.  The 
work  place  carries  with  it  the  connotation  of  the  thinking  cap--an  environment  for 
accomplishment;  and  yet  has  a richness  not  shared  by  conventional  computer  terminals. 


The  possibility  of  portable  vork  Is  also  highly  desirable,  and  makes  It  possible  to 
grasp  targets  of  opportunity,  such  as  a pleaaant  day  In  the  park.  Terminal  access 
technology  of  the  1990s  will  have  two  incarnations:  first,  as  an  electronic  place  for 
doing  work,  far  exceeding  current  terminals  In  richness  of  interaction;  second,  as 
portable  terminals  that  are  always  available  to  the  user  for  work  In  any  location.  The 
electronic  place  may  be  thought  of  as  a terminal  that  Is  a room--a  multichannel,  multi- 
model terminal  that  knows  the  location  and  attention  of  the  user,  and  Is  an  active 
partner  with  the  user  in  solving  problems.  The  major  functional,  rather  than  emotional, 
advantage  of  the  electronic  place  Is  in  the  explicit  use  of  redundancy.  Current 
computer  systems.  Including  computer  terminals,  avoid  redundancy  like  the  plague. 
However,  it  Is  redundancy  In  the  environment  that  makes  It  possible  for  Imperfect  human 
beings  to  function.  An  environment  without  redundancy,  such  as  those  constructed  with 
modern  computers,  place  a heavy  burden  on  people  and  Inevitably  lead  to  errors. 

The  construction  of  a portable  computer  terminal  will  not  be  a significant 
challenge  by  the  1990s,  due  to  new  batteries,  low  power  flat  displays,  ultra-low  power 
LSI  components,  and  a variety  of  I/O  means  as  previously  described.  Communications  may 
be  provided  by  packet  radio  or  by  other  means.  Although  the  conventional  Idea  of  a 
portable  terminal  Is  the  most  likely  to  be  desired  and  hence  to  occur,  more  exotic  pos- 
sibilities are  conceivable.  Implantable  terminals  will  be  technically  feasible,  but 
probably  feared.  However,  almost  as  good,  terminals  may  be  Imbedded  In  clothing.  Elec- 
tronic clothing  could  contain  a distribution  of  components  connected  by  an  optical 
packet-switched  bus  and  might  be  powered  thermoelectrlcally , using  threads  of  dissimilar 
metals  sewn  Into  the  fabric.  With  such  a suit,  a computer  user  of  the  1990s  would  have 
his  terminal  with  him  everywhere,  or  at  least  everywhere  he  would  want  It. 

SUMMARY 

Regaining  the  advantages  of  human  communication,  paper  as  an  office  medium,  and 
Intensive  and  accessible  work  environments  will  be  a high  priority  for  future  purchasers 
of  computer  service,  and  is  currently  a high  priority  for  the  RAD  of  computer  terminals 
of  the  1990s. 
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SUMMARY 

A review  is  made  of  cost/performance  trends  in  computing  technology,  including  hardware 
elements  and  software  development.  The  resulting  impact  on  computer  systems  architecture  i 

is  briefly  covered.  Particular  emphasis  is  placed  on  the  trend  toward  decentralization.  | 

The  evolution  of  the  development  of  distributed  computing  and  the  use  of  modern  dedicated  j 

small  computers  is  examined  and  analyzed.  I 

An  explanation  is  given  of  Grosch's  Law,  which  has  described  the  relationship  of  comput- 
ing power  and  computer  price.  The  possibility  is  explored  that  recent  developments  may 
invalidate  the  conclusion  based  on  Grosch's  Law  (hitherto  commonly  accepted)  that  any 
enterprise  should  get  its  data  processing  done  on  the  largest  possible  computer  that  the 
enterprise  can  afford  to  acquire,  or  to  which  it  can  afford  to  buy  access.  Conclusions 
are  drawn  that  a radical  revolution  In  the  use  of  computers  has  begun,  and  that  in  the 
future,  a new  "Principle  of  Decentralized  Computing*  may  replace  Grosch's  Law. 


INTRODUCTION 

A while  ago,  as  I was  skimming  through  the  Wall  Street  Journal,  I was  approached  by  a 
philosopher  friend  of  mine.  I commented  on  the  interesting  item  I had  been  reading  about 
the  remarkable  growth  of  the  consumer  power  tool  industry.  It  seems  that  astonishing 
records  are  being  set  for  the  sale  to  home-owners  of  one-quarter  inch  electric  drills. 

I expressed  my  surprise  that  there  were  so  many  people  who  wanted  one-quarter  inch  drills. 
My  friend  observed,  "I  don't  believe  there  is  any  real  demand  for  them."  When  I asked 
how  he  could  come  to  such  a conclusion  in  the  face  oi  the  evidence,  he  pointed  out  that, 
"People  do  not  necessarily  want  one-quarter  inch  drills.  The  things  they  really  want 
are  a great  many  one-quarter  inch  holes. " 

This  paper  will  attempt  to  look  into  the  future  of  computing  from  such  a point  of  view. 
Computer  professionals,  in  their  saner  moments,  will  acknowledge  that  computers  are  only 
tools  to  en2Uale  us  all  to  achieve  our  real  objectives.  But  unfortunately,  computer  pro- 
fessionals usually  talk  and  act  as  though  the  computer  is  an  end  in  itself.  However, 
the  realization  is  slowly  coming  into  focus  that  we  don't  really  want  one-quarter  inch 
drills;  we  just  want  a lot  of  one-quarter  inch  holes. 

Fortunately,  technological  improvements  which  are  driving  down  the  cost  of  computing  will 
make  it  possible  for  users  to  approach  the  subject  a little  more  rationally  than  we  have 
in  the  last  twenty  years.  Therefore,  I will  first  review  those  technological  improvements 
and  give  you  a glimpse  of  the  cost-performance  trends  for  hardware  and  for  software.  Then 
I will  show  you  the  impact  that  these  trends  are  having,  and  apparently  will  have,  on  the 
architecture  of  information  processing  systems.  I will  place  particular  emphasis  on  the 
trend  toward  decentralization.  I will  trace  the  evolution  of  the  use  of  dedicated  small 
computers  and  of  the  development  of  distributed  computing.  Next,  I will  explain  Grosch's 
Law,  which  has  described  the  relationship  of  computing  power  and  computing  price.  The 
possibility  will  be  explored  that  recent  developments  may  invalidate  the  conclusion, 
based  on  Grosch's  Law  (and  thus,  commonly  accepted)  that  any  enterprise  should  get  its 
data  processing  done  on  the  largest  possible  computer  that  the  enterprise  can  afford  to 
acquire,  or  to  which  it  can  afford  to  buy  access.  Finally,  I will  draw  the  conclusion 
that  a radical  revolution  in  the  use  of  computers  has  begun,  and  that  in  the  future,  a 
new  "Principle  of  Decentralized  Computing”  should  replace  Grosch's  Law. 

COST- PERFORMANCE  TRENDS  IN  COMPUTING 

GENERAL 

Almost  any  one  of  the  references  to  this  paper  will  contain  the  opinion  that  the  costs 
of  all  elements  of  computing  are  declining,  some  more  rapidly  than  others,  and  that,  at 
the  same  time,  performance  is  increasing.  From  such  generalities,  many  opinions  are 
formed,  and  consequences  deduced.  However,  if  you  ask  yourself  the  question,  "How  much 
has  cost  declined  of  any  particular  element  of  the  cost  of  doing  computing?"  it  becomes 
a very  difficult  question  to  answer,  and  thus,  one  begins  to  doubt  the  validity  of  some 
of  the  conclusions.  I am  a great  believer  in  learning  from  the  thoughts  of  men  of 
demonstrated  Intellectual  capacity.  One  of  my  favorite  quotations  is  from  the  great 
British  physicist  of  the  nineteenth  century,  William  Thompson,  Lord  Kelvin; 

"When  you  can  measure  what  you  are  speaking  about,  and  express  it  in  numbers, 
you  know  something  about  it;  but  when  you  cannot  measure  it,  when  you  cannot 
express  it  in  numbers,  your  knowledge  is  of  a meager  and  unsatisfactory  kind; 
it  may  be  the  beginning  of  knowledge,  but  you  have  scarcely,  in  your  thoughts, 
advanced  to  the  stage  of  science.” 
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Consequently,  I made  an  effort  to  quantify  these  cost  trends. 

I immediately  ran  into  several  problems  of  reconciling  the  available  data.  No  authors 
approached  the  matter  from  the  same  point  of  view.  None  of  the  performance  data  was 
expressed  in  comparable  units.  Some  authors  concentrated  primarily  on  the  past, 

(Ref.  1, 2, 3, 4, 5, 6, 7, 8) ; others,  primarily  projected  into  the  future  (Ref.  9,10,11,12,13). 
Very  few  authors  defined  adequately  the  makeup  of  the  sub-system  they  were  describing. 

For  example,  is  a CPU  (Central  Processing  Unit)  only  the  processing  logic?  Does  it 
include  high-speed  internal  storage?  Does  it  include  its  power  supplies?  In  many  cases, 
the  meaning  of  "cost”  was  not  defined.  Was  it  the  cost  of  manufacturing,  or  the  ultisMte 
price  to  the  consumer?  Since  the  latter  is  more  meaningful  in  the  context  of  this  paper, 

I have  adopted  as  my  standard. 

But  that  brought  me  right  up  against  another  problem.  Withington,  in  Ref.  10,  makes  it 
clear  that,  after  the  end  of  any  decade,  you  simply  cannot  buy  a thing  directly  comparable 
to  the  thing  you  bought  at  the  beginning  of  the  decade.  For  example,  useful  computer 
systems  with  only  2K  of  Internal  high-speed  storage  were  normal  in  1955,  but  by  1965 
they  were  not  generally  offered  for  sale.  This  situation  gets  completely  unmanageable 
when  one  looks  at  software  costs.  A very  acceptable  software  progr2un  in  1965  would  t>e 
totally  unacceptable  in  1975.  The  1965  model  would  have  been  developed  without  adequate 
specifications;  it  would  be  prohibitively  hard  to  maintain;  its  documentation  would  be 
rudimentary  and  inadequate;  its  testing  would  not  have  been  subject  to  modern  standards 
of  quality  assurance,  etc.,  etc.  In  spite  of  such  obstacles,  I persisted  in  my  efforts 
at  synthesizing  the  data.  The  brilliant  mathematical  logician,  Alfred  North  Whitehead, 
says:  "The  art  of  progress  is  to  preserve  order  amid  change,  and  to  preserve  change 

amid  order."  In  humble  emulation  of  Whitehead  and  Kelvin,  I synthesized,  in  Figure  1, 
all  the  data  available  to  me.  In  all  cases,  I show  the  cost  to  the  user  for  an  element 
of  a computing  system  relative  to  its  cost  in  1960.  I believe  that,  with  a few  exceptions 
noted  below,  I have  ’tdequately  reconciled  most  of  the  anomalies. 

HARDWARE 

I have  selected  for  study  three  of  the  physical  elements  that  go  into  a computing  system: 
(1)  Processing  and  Internal  Storage  (to  represent  the  cost  of  raw  computing  power) , (2) 
Fast  Access  Mass  Storage  (to  represent  the  cost  of  having  all  of  the  data  required  rapidly 
available  for  processing)  and  (3)  Common  Carrier  Communication  Lines  (to  represent  the 
costs  of  using  a computer  from  a remote  location) . 

To  study  "Processing  and  Internal  Storage"  I have  used  the  cost  of  what  is  commonly  called 
a CPU  (Central  Processing  Unit) . It  Includes  all  the  logic  necessary  to  interpret  and 
execute  the  instructions  to  the  computer,  and  the  high-speed  internal  storage  in  which 
resides  the  program  and  the  data  currently  being  processed.  It  also  includes  such  appur- 
tenances as  the  power  supply  and  the  cabinet.  The  data  was  obtained  primarily  from  Ref. 
5,10,  and  Auerbach's  paper  in  Ref.  14.  Costs  for  such  CPU's  from  1965  on,  typically 
represent  the  cost  of  a system  with  much  more  powerful  logic,  and  a much  greater  quantity 
of  internal  storage  than  1960  CPU's.  Hence,  the  costs  shown,  particularly  for  1975 
through  1985,  are  actually  for  CPU's  much  more  powerful  than  the  comparable  ones  of  1960. 
In  spite  of  this  defect  in  strict  comparability,  the  index  of  costs  shows  a spectacular 
improvement  through  the  years.  There  is  nothing  startling  about  this  conclusion  — hardly 
anyone  would  challenge  it.  The  data  are  presented  for  comparison  with  the  other  elements 
of  a computing  system.  1975  CPU's  cost  approximately  one-half  of  1%  of  those  of  1960. 

The  1985  systems  will  cost  about  20%  of  the  1975  systems.  Clearly,  in  any  analysis  of 
computing  costs  in  the  next  decade,  the  cost  of  the  central  processor  and  its  internal 
storage  has  descended  into  the  noise  level. 

When  one  turns  to  the  relative  costs  of  Fast  Access  Mass  Storage,  there  appears  a picture 
not  so  widely  appreciated.  "Fast  Access  Mass  Storage"  means  the  external  on-line  storage 
of  the  computer,  in  which  is  held  the  data  (and  progreuns)  that  could  not  be  held  in 
internal  storage,  but  which  requires  rapid  access  for  transfer  to  internal  storage  for 
processing.  (I  do  not  consider  here  the  more  esoteric  super-size  storage  with  relatively 
slow  access,  currently  represented  by  such  devices  as  the  Ampex  Terra-bit  and  IBM  3850.) 
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Prior  to  1960,  the  most  widely  used  device  for  this  purpose  was  magnetic  tape,  inadequately 
supplemented  by  small  capacity  magnetic  drums.  Beginning  in  1960,  rotating  magnetic  disks 
became  more  and  more  prominent,  so  that  eventually,  magnetic  tapes  were  largely  relegated 
to  the  role  of  off-line  and/or  archival  storage.  In  the  1960 's,  it  is  expected  that  many 
of  these  devices  will  Include  semi-conductor  technology,  CCD  (Charge-Coupled  Devices, 

Ref.  11,15)  and  magnetic  domain  devices  such  as  bubble  memories  (Ref.  16).  The  data  were 
obtained  primarily  from  Ref.  6 and  10.  In  1960,  the  costs  were  based  upon  the  original 
1956  IBM  RAMAC,  the  1960  IBM  1301,  and  the  1962  IBM  2311,  the  first  major  removable  disk 
pack.  The  point  plotted  for  1974  represents  the  current  200  megabite  IBM  3330-11.  In 
order  to  bring  these,  and  the  future  more  exotic  devices,  onto  a common  ground,  it  was 
necessary  to  "normalize"  the  difference  in  size  of  storage  and  speed  of  access.  For  this 
purpose,  I used  a "Unit  of  Capability,"  defined  as  the  storage  capacity  in  kilo-bites 
divided  by  the  access  time  in  mi lie -seconds.  Comparable  costs,  then,  are  costs  per  unit 
of  capability.  Most  of  the  conventional  wisdom  that  I have  come  across  has  agreed  that 
the  cost  of  such  fast  access  mass  storage  has  come  down  "but  not  very  much.”  My  data 
show,  on  Figure  1,  that  the  cost  Indeed  has  come  down  sharply;  1975  costs  are  only  about 
2%  of  1960  costs,  and  1985  costs  will  be  about  10%  of  1975  costa. 

Since  the  faunous  "marriage  of  computers  and  communications”  (though  heralded  by  Bauer  for 
some  years  before,  the  term  first  appeared  in  print  in  Ref.  17) , an  Increasing  fraction 


of  the  cost  of  getting  computing  done  has  been  the  cost  of  Communications.  Cost  of 
"Communications"  would  include  the  remote  terminal  devices,  the  modems,  multiplexors, 
and  concentrators  (some  of  which  are  on  the  end  of  every  digital  communication  line) 
and  the  lines  themselves.  The  cost  of  the  lines  themselves  represent  a substantial 
portion  of  such  costs.  The  most  glamorous  portion  of  line  costs  are  the  "long  lines" 
from  city  to  city,  now  frequently  mechanized  via  satellite.  However,  what  is  not  com- 
monly recognized  is  that  the  local  connections  (particularly  in  situations  with  multiple 
dispersed  terminals  where  dial-up  services  are  inadequate)  frequently  represent  the 
largest  portion  of  the  cost.  Line  costs  to  the  user  are  not  necessarily  susceptible 
to  advances  in  technology.  Typically,  the  service  must  be  obtained  from  a regulated 
ccxmnon  carrier,  whose  prices  are  set  by  public  authorities  on  the  basis  of  return  on 
investment.  High  density  routes  (where  unit  costs  to  the  carrier  are  small)  frequently 
command  high  prices  in  order  to  subsidize  low  density  connections  to  remote  hamlets, 
where  the  unit  costs  to  the  carrier  are  very  large.  Thus,  the  price  to  the  consigner  is 
normally  not  a function  of  technology. 

My  cost  data  comes  primarily  from  Ref.  3 and  10.  On  Figure  1 we  can  see  that  the  reduc- 
tion of  the  costs  of  the  communication  lines  is  small,  compared  with  the  vastly  reduced 
costs  of  the  other  hardware  elements  of  computing.  1975  costs  are  only  61*  of  those  in 
1960;  1985  costs  will  be  about  53*  of  1975 's.  The  cost  of  other  elements  of  the 
networ)c  will,  of  course,  decrease  in  direct  proportion  to  the  amount  of  electronics  that 
they  contain.  Nevertheless,  it  is  clear  that,  of  all  the  elements  of  computing  cost,  the 
least  decrease  is  represented  by  the  communications  portion. 

SOFTWARE 

When  one  approaches  a study  of  the  costs  of  software,  one  is  keenly  aware  of  Kelvin's 
dictum,  "...when  you  cannot  express  it  in  numbers,  your  knowledge  is  of  a meager  and 
unsatisfactory  kind.."  Professor  Brooks  has  illuminated  the  difficulty  of  the  problem 
in  his  landmark  paper,  "The  Mythical  Man  Month"  (Ref.  18).  Nevertheless,  a few  daring 
souls  have  attempted  to  quantify  the  problem,  principally  a team  of  dedicated  researchers 
at  the  System  Development  Corporation.  An  easily  accessible  summary  of  their  results  is 
presented  by  Boehm  in  Ref.  2.  It  shows  that  there  is  a spread  of  1000*  between  the  10th 
and  90th  percentiles  of  software  productivity,  if  it  is  measured  by  BAL  (Basic  Assembly 
Language)  instructions  per  man  month.  If  we  take  the  median  of  these  data,  we  can 
tabulate  it  thus: 

Year  1955  1960  1965  1970  1975  1980  1985 

BAL  Instructions  Per  Man  Month  200  300  450  700  1100  1650  2500 

A large  part  of  such  improvement  in  productivity  comes  from  the  availeUsility  of  improved 
implementation  systems  — the  tools  for  building  software.  In  1955  only  primitive 
assemblers  were  available.  Later  the  first  generation  of  higher  order  languages  appeared; 
FORTRAN,  COBOL  and  JOVIAL.  Today  for  business  application  programs,  such  sophisticated 
implementation  systems  as  Informatics  MARK  IV  and  IBM's  RPG  II  enable  programmers  to  >- 
produce  a high  quality  equivalent  of  BAL  instructions  from  five  to  fifty  times  faster 
than  the  tools  of  1960  permitted.  Similar  trends  are  noted  in  the  construction  of  system 
programs,  where  the  pioneers,  SDC's  JOVIAL  and  Digitec's  POPS,  have  been  followed  by 
modern  proprietary  systems,  such  as  IBM's  PL/S,  Softek's  AED,  CAP's  CPL/1. 

Surprisingly,  all  this  productivity  is  preserved  when  converted  into  cost  test  per 
instruction.  The  cost  per  man  month  of  a programmer  (and  his  computer  test  time)  capable 
of  producing  such  an  instruction  is  not  growing.  It  is  almost  a constant  in  the  range  of 
$3500  - $3800  per  man  month  total  cost.  Direct  labor  cost,  although  growing,  has  not 
kept  pace  with  inflation,  as  shown  by  Frank  in  Ref.  3.  The  increases  are  compensated  for 
by  a big  decrease  in  overhead  costs.  There  has  also  been  a very  significant  reduction  in 
the  costs  of  machine  time  used  each  month  by  a progreunmer  for  testing. 

If  the  above  data  on  instructions  per  man  month  are  divided  by  the  cost  per  man  month, 
we  obtain  a figure  for  cost  per  BAL  instruction.  Relative  values  thereof  are  plotted  in 
Figure  1 as  the  curve  labelled  "Custom  Software  Development. " The  surprising  result  is 
that  the  cost  of  an  equivalent  program,  comprising  10,000  instructions  today  is  about 
28*  of  its  cost  in  1960  — and  in  1985,  it  will  be  less  than  half  of  its  cost  today. 

But  I must  repeat,  there  is  no  such  thing  as  an  "equivalent"  program  of  10,000  BAL 
instructions.  Today  we  demand  much  more;  for  example,  data  security,  "fail-soft"  char- 
acteristics, automatic  recovery  — things  which  were  rarely  heard  of  in  1960.  Conse- 
quently, it  is  clear  that  the  cost  of  developing  software  has  not  come  down  anything 
like  the  amount  that  hardware  cost  has  decreased  — indeed,  its  cost  reduction  is  almost 
as  disappointing  as  communication  costs. 

However,  another  development  has  taken  place,  t>eginnlng  in  about  1965  (Ref.  3,19,  and  21). 
In  1960  the  concept  of  a "Software  Product"  was  the  dream  of  a few  pioneers.  In  Ref.  20, 
Postley  described  the  concept  (and  the  program  which  was  the  prototype  of  Informatics 
HARK  IV,  the  most  successful  software  product  from  any  supplier  except  IBM) . The  concept 
was  simple.  Do  the  development  once,  but  do  it  in  such  a way  that  the  identical  software 
could  be  used  by  many  different  organizations.  Of  course,  this  has  always  been  done  for 
basic  system  programs  such  as  language  processors,  control  programs,  and  access  methods. 
But  in  1960,  it  was  deemed  Impractical  to  do  so  for  application  programs  (which  represent 
the  major  part  of  software  development  costs).  Stimulated  by  the  success  of  MARK  IV,  such 
extension  to  application  progreuns  began  in  the  late  1960's,  and  application  software 
products  are  commonplace  today. 


Th«re  is  no  reliable  published  data  on  the  economics  of  using  software  products,  and  such 
data  would  vary  widely,  depending  on  the  type  of  product  used.  I have  estimated,  however, 
that  the  equivalent  costs  of  using  a software  product  had,  by  1970,  declined  to  25%  of 
the  cost  of  a custom  developed  equivalent  system,  and  by  1975,  were  certainly  no  greater 
than  15%.  In  some  cases,  the  ratio  could  be  as  little  as  1%.  In  Figure  1,  I have  plotted 
the  most  conservative  curve,  which  shows  that,  using  software  products,  1975  costs  are 
about  6%  of  software  costs  in  1960,  and  that  1985  costs  will  be  about  33%  of  1975' s. 

SUMMARY  OF  COST  PERFORMANCE  TRENDS 


Dramatic  reductions  began  in  1960  in  cost  performance  trends  for  the  elements  of  computing 
systems.  Major  cost  reductions  in  computers  themselves  have  been  accomplished  by  1975^ 
and  there  will  be  continuing  reductions  through  1985.  Communication  costs,  on  the  other 
hand,  lag  far  behind  all  other  types  of  hardware  costs.  The  cost  of  software  development 
has  been  progressing  nicely,  but  dr2unatic  Improvements  can  only  be  achieved  by  using 
software  products.  The  past  and  future  Improvements  are: 

1975  Costs  as  a Fraction  1985  Costs  as  a Fraction 
of  1960  Costs of  1975  Costs 


Hardware 


Processing  and  Internal  Storage 

0.005 

0.20 

Fast  Access  Mass  Storage 

0.02 

0.10 

Common  Carrier  Communication  Lines 

0.61 

0.53 

Software 

Custom  Development 

0.28 

0.47 

Software  Products 

0.06 

0.33 

One  conclusion  that  can  be  drawn  from  this  is  represented  by  Figure  2 , which  shows  that 
hardware  costs  represent  an  ever  decreasing  percentage  of  total  budgets  for  computing.  A 
curve  of  this  sort  has  been  used  many  times  in  the  past  — I first  saw  it  at  Informatics 
in  1963  in  an  internal  planning  document  prepared  by  W.  Frank.  The  version  shown  here 
was  used  in  Ref.  2 to  show  the  computing  budget  of  the  U.S.  Air  Force.  Boehm  does  not 
describe  therein  whether  communication  costs  are  included.  It  is  my  belief  that  they 
are  not. 

IMPACT  ON  THE  SYSTEM  ARCHITECTURE 

OF  INFORMATION  PROCESSING  SYSTEMS 

GENERAL 

"System"  is  a word  which  can  have  many  meanings.  If  all  you  know  is  how  to  make  screw- 
threads,  then  an  assembly  of  a nut  and  a bolt  is  a "system."  So  one  satirical  definition 
is:  "A  system  is  an  interacting  combination  of  functional  parts  at  least  one  level  more 

complex  than  the  user  of  the  word  'system*  can  understand." 

I eun  using  the  word  in  that  way:  An  information  processing  system  is  the  entire  assemb- 

lage of  data  processing  hardware,  software,  procedures,  and  people,  %diich  are  put  together 
to  accomplish  some  useful  objective.  "Architecture"  is  the  configuration  of  the  elements 
(with  specified  characteristics)  of  the  system.  The  impact  of  improvements  in  technology 
on  such  systems  occurs  rapidly  in  the  case  of  systems  both  small; and  simple,  and  takes 
a long  time  in  the  case  of  systems  both  large  and  complex. 

For  example,  consider  a small  simple  system:  Its  purpose  is  data  collection  and  data 

reduction  in  an  experimental  laJsoratory.  Ten  years  ago,  the  system  architecture  would 
very  likely  have  included  analog  instrumentation  whose  output  was  passed  through  expensive 
converters  and  recorded  on  a digital  tape.  After  each  set  of  measurements,  the  tape  reel 
was  sent  to  a central  data  processing  facility  for  data  reduction.  Printed  results  of 
the  experiment  were  delivered  to  the  scientist  a day  or  two  later  for  plotting,  study, 
and  evaluation.  Depending  on  the  results,  the  measurements  might  be  re-recorded  or  the 
experiment  reoriented.  Improvements  in  computer  technology  have  had  a rapid  impact  on 
such  a system.  Today,  the  transducers  would  very  likely  have  digital  output  read  directly 
into  an  inexpensive  minicomputer.  The  procedure  would  be  to  have  the  minicomputer,  on 
the  spot,  do  the  data  reduction  and  rudimentary  analysis,  displaying  the  results  on  a CRT 
terminal,  and  printing  them  on  a typewriter.  Such  a "quick  look"  capability  enables  the 
experimenter  to  evaluate  his  results  immediately,  avoid  many  blind  alleys,  and  thus,  be 
much  more  productive.  His  total  data  processing  hardware,  the  minicomputer  and  its 
terminals,  costs  no  more  today  than  his  simpler  data  collection  sub-systems  did  ten  years 
ago. 

In  the  case  of  a much  larger,  more  complex  system,  the  Impact  of  technological  improvements 
on  system  architecture  takes  a much  longer  time.  For  example,  consider  the  business  data 
processing  operations  of  a very  large  company.  Ten  years  ago,  input  data  was  keypunched, 
and  then  processed  at  a central  site  by  the  largest  available  commercial  computers,  pri- 
marily in  batch  mode.  Printed  reports  were  distributed  days  later.  Today,  the  basic 
system  is  not  greatly  different.  Although  on-line,  transaction-oriented  procedures  are 
beginning  to  be  used,  the  vast  majority  of  today's  business  processing  is  done  much  as 
it  was  ten  years  ago,  except  that  remote  terminals  may  read  the  cards  and  print  the 
reports. 


TREND  TOWARD  DECENTRALIZATION 


Nevertheless,  new  trends  are  beginning  to  develop.  As  might  be  expected,  those  sub-system 
components  which  have  had  the  greatest  increase  in  cost  effectiveness  have  had  the  greatest 
influence  on  architecture.  The  dramatic  reduction  of  cost  in  processing  logic  has  created 
a trend  toward  incorporating  "intelligence,"  (i.e.,  processing  capability)  into  the  remote 
terminals.  There  is  a small  but  definite  growth  in  such  intelligent  terminals,  which  are 
really  minicomputers.  More  and  more,  the  processing  which  can  be  done  remotely  is  done 
remotely,  and  only  that  which  requires  a central  computer  is  transmitted  to  and  Trom  the 
central  site.  This  concept,  known  as  "distributed  processing”  is  rapidly  gaining  in 
popularity.  (Ref.  22,23,24,25,26,27,  and,  especially,  28). 

FREE-STANDING  MINICOMPUTERS 

Ten  years  ago,  if  a small  sized  organization  needed  data  processing,  it  could  not  afford 
to  have  control  of  its  own  data  processing  system.  If  it  were  an  independent  company, 
its  only  choice  was  to  have  the  work  done  by  a vendor  of  data  processing  services. 
Typically,  this  was  a batch  service  bureau,  which  processed  the  work  for  a large  number 
of  small  companies. 

If  the  small  unit  was  part  of  a large  corporation,  it  had  no  choice  but  to  send  its  work 
to  the  organization's  central  data  processing  facility  (Ref.  8).  The  availability  of 
small  minicomputers  for  business  data  processing  has  made  possible  an  entirely  different 
system  for  getting  such  work  done.  (Ref.  29).  It  is  most  evident  in  the  small  company. 

In  the  last  five  years,  due  to  the  introduction  of  the  IBM  System/3,  and  then  of  the 
IBM  System  32,  over  30,000  companies  (Ref.  30)  have  obtained  their  own  computer  and  do 
their  work  under  their  own  control. 

In  the  case  of  small  units  within  large  organizations,  such  a trend  has  not  yet  developed 
as  strongly.  Although  the  same  logical  forces  are  operating,  they  are  usually  opposed 
by  the  vested  interest  of  the  management  of  centralized  data  processing.  (White,  in 
Ref.  31  describes  a notable  exception.)  Distributed  processing,  described  above,  is  a 
hybrid  system,  partially  decentralized.  It  is  supported  by  the  management  of  centralized 
data  processing  as  long  as  they  can  retain  control  over  it.  However,  since  the  cost  of 
communication  is  not  coming  down  as  rapidly  as  all  other  hardware,  there  is  a strong 
possibility  that  it  may  soon  become  obvious  that  the  remote  processing  done  under  a 
distributed  processing  architecture  will  require  a telephone  link  to  a central  site  only 
in  rare  instances.  I expect  to  see  such  umbilical  cords  cut  with  increasing  frequency 
in  the  next  decade. 

THE  SOFTWARE  PROBLEM 

At  present,  the  cost  of  software  seems  to  have  mixed  effects  on  the  trend  toward  decentral- 
ization. Bookwalter  has  analyzed  this  problem,  as  reported  in  Ref.  32.  On  the  one  hand, 
there  are  several  factors  which  seem  to  indicate  that  software  costs  on  small  decentral- 
ized computers  should  be  greater  than  such  costs  on  large  central  computers,  and  thus 
inhibit  the  trend.  Sophisticated  tools  for  software  development  on  small  computers  are 
available  in  much  less  variety  than  on  large  machines.  Good  operating  systems,  with 
conveniences  for  the  programmer,  are  rare.  With  the  exception  of  RPG  II  on  Systems/3  and 
32,  second  generation  implementation  systems,  such  as  COBOL  compilers  and  only  one  Data 
Base  Management  System,  a small  version  of  TOTAL,  are  just  beginning  to  become  available 
on  a relatively  few  small  machines.  Third  generation  implementation  systems,  necessary 
for  high  productivity,  such  as  Informatics  MARK  IV,  are  available  on  only  one  small 
computer  — the  Microdata  "REALITY"  System,  mentioned  in  Ref.  9. 

On  the  other  hand,  there  are  opposing  factors  which  tend  to  reduce  the  cost  of  software 
development  on  decentralized  minicomputers.  Perhaps  the  most  powerful  factor  is  that 
the  computer  is  specialized.  It  is  not  all  things  to  all  men  — when  best  used,  it 
tends  to  be  dedicated  to  a single  application  or  a group  of  related  applications.  Under 
such  circumstances,  there  is  no  need  for  a sophisticated  operating  system  with  all  its 
mind-boggling  complexity.  (Patrick's  excellent  analysis.  Ref.  33,  discusses  how  system 
complexity  can  increase  costs  by  a factor  of  four.)  Therefore,  the  programming  is 
straightforward  and  uncomplicated.  Another  factor  of  increasing  importance  is  the 
accelerating  trend  toward  the  availability  of  software  products  on  minicomputers.  As 
discussed  at>ove,  software  costs  can  be  reduced  most  dramatically  by  the  use  of  general 
purpose  software  products.  The  System  32  is  better  equipped  with  the  application  products 
than  any  large  machine.  As  these  become  more  available  for  other  minis,  the  prior 
inhibiting  effect  of  software  development  on  the  trend  will  disappear. 

HARDWARE 

The  dramatic  decrease  in  the  cost  of  processing  logic  has  made  possible  a development 
which  has  not  yet  emerged  as  a trend.  But  it  seems  so  logically  inevitable  that  I 
expect  to  see  it  emerge  within  the  next  five  years.  The  concept  addresses  the  software 
problem  by  exploiting  the  low  cost  of  hardware  logic  to  use  existing  software.  I can 
best  illustrate  it  by  describing  a case  in  which  the  technique  produced  dramatic  results. 

Almost  ten  years  ago,  a large  organization  needed  a very  sophisticated  message  switching 
system.  They  selected  a very  powerful,  highly  sophisticated  computer  — the  best  one 
for  the  purpose  available  at  the  time.  The  software  necessary  to  Implement  the  system 
was  a major  development  project.  It  took  a number  of  years  and  cost  several  million 
dollars.  Subsequently,  several  other  organizations  acquired  the  same  hardware,  and  built 


on  the  base  of  the  same  software,  enhancing  it  as  the  years  went  by.  The  replacement 
costs  of  that  software  toaay  would  perhaps  be  $5,000,000;  it  was  by  far  the  best  avail- 
able answer  to  the  requirements.  In  the  early  70 's,  the  hardware  began  to  become 
obsolescent  and  its  cost-effectiveness  declined,  so  that  new  users  were  reluctant  to 
acquire  such  hardware.  New  modern  minis,  as  we  have  seen  above,  could  perform  the  same 
functions  at  far  less  cost.  But  their  use  was  impractical;  the  cost  of  rewriting  the 
software  was  prohibitive . 

So  a concept  was  developed  to  use  such  relatively  inexpensive  hardware  for  processing 
the  existing  software  by  the  well-known  technique  of  hardware  emulation.  A relatively 
inexpensive,  reliable  minicomputer  from  a reputable  manufacturer  was  selected.  Enhance- 
ments to  the  hardware  were  designed,  which  enabled  the  message-switching  software  to  run 
on  the  foreign  machine.  The  cost  of  hardware  development  and  manufacturing  was  less 
than  10%  of  the  cost  of  rewriting  the  software.  The  project  was  successfully  completed, 
and  a substitute  ivas  developed,  highly  cost-effective  by  1972  standards.  By  1980,  such 
a substicute  can  be  developed  for  20%  of  the  cost  of  the  1972  model.  (Ref.  34,35). 

There  is  available  today  a growing  inventory  of  software  products  for  large,  expensive 
computers.  I believe  that  inexpensive  hardware  enhancements  to  modern  minicomputers 
will  soon  enable  them  to  run  this  inventory  of  available  software.  The  most  cost  effec- 
tive approach  will  be  tc  dedicate  a machine  to  a specific  purpose  (See  Ref.  33) . Today 
you  have  on  your  wrist  a computer  dedicated  to  telling  you  what  time  it  is.  Soon  you 
will  buy  a computer  to  do  only  accounts  receivable  or  document  accession,  or  whatever 
your  job  is. 

GROSCH'S  LAW 

In  the  first  two  decades  of  the  computer  age,  from  1950  through  1970,  a number  of 
generations  of  machines  were  developed.  As  each  major  computer  was  offered  for  sale, 
it  could  perform  more  work  for  less  money  than  its  predecessors.  Dr.  H.R.J.  Grosch, 
the  recently  elected  president  of  the  Association  for  Computing  Machinery,  was  the  first 
one  to  popularize  a formal  description  of  this  phenomenon.  Humorously  at  first,  it 
beccime  known  as  Grosch *s  Law,  formulated  as  follows:  "Throughput  capacity  of  a computer 

is  proportional  to  the  square  of  its  price."  A number  of  more  formal  documented  studies 
of  this  phenomenon  have  confirmed  that  the  "Law"  was  indeed  valid  for  the  large  computers 
of  that  era  when  operated  in  batch  mode.  (Ref.  1,7).  (Cynics,  however,  pointed  out,  in 
Ref.  8,  that  the  Law  was  not  fundamental  because  of  the  economics  of  engineering  develop- 
ment, but  only  because  of  IBM  pricing  strategy.) 

In  the  same  era,  there  was  another  phenomenon  occurring.  The  users  of  such  large  machines 
were  making  greater  and  greater  use  of  computers.  More  and  more  computer  applications 
were  being  developed,  as  described  in  Ref.  33. 

Data  processing  management,  responsible  for  selection  and  procurement  of  computing  equip- 
ment, became  well  aware  of  the  "economies  of  scale,"  (Ref. 1,7).  Since  the  demand  for 
capacity  seemed  to  be  growing  without  limit,  there  evolved  a corollary  to  Grosch 's  Law 
that  I call  the  "Dogma  of  Data  Processing."  it  is;  "The  most  economical  way  to  do 
computing  is  to  acquire  the  largest  computer  that  the  enterprise  can  possibly  foresee  a 
need  for;  therefore,  all  computing  in  the  enterprise  must  be  done  in  one  central  computing 
facility."  The  application  of  the  Dogma  made  it  clear  that  an  application  o^  "Engine 
Charlie"  Wilson's  famous  quotation:  "What  is  good  for  General  Motors..."  was  not  bad  for 
General  Motors  Data  Processing  manager!  His  prestige,  power  authority,  and  salary  were 
also  proportional  to  the  size  of  the  installation  that  he  ruled.  Hence,  there  has 
evolved  a strong  "union"  of  data  processing  managers,  who  quote  the  gospel  according 
to  Grosch  in  defense  of  centralization,  and  treat  as  heretics  the  radicals  who  would 
propose  decentralization. 

THE  REPEAL  OF  GROSCH'S  LAW 

As  the  technological  developments  described  earlier  began  to  occur,  the  price- throughput 
relationship  of  Grosch’s  Law  began  to  disappear.  Half  in  jest,  but  with  astounding 
foresight,  Adams  first  headlined  "GROSCH'S  LAW  REPEALED"  in  1962.  (Ref.  36).  On  the 
one  hand,  the  very  large  machines  began  to  encounter  some  dis-economies  of  scale.  As 
more  and  more  work  was  done  on  a single  mainframe,  the  operating  system,  in  order  to 
sort  it  all  out,  got  more  and  more  complicated.  Operating  system  overhead  began  to 
climb,  so  that  less  and  less  useful  work  was  actually  being  done.  Patrick,  in  Ref.  33, 
points  out  the  possibility  of  such  systems  growing  large  enough  to  fall  of  their  own 
weight. 

On  the  other  hand,  as  we  have  seen,  the  new  small  cheap  minicomputers  simply  did  not 
follow  the  law  that  "throughput  capacity  is  proportional  to  the  square  of  the  price." 
Although  I am  unaware  of  any  computations  to  demonstrate  the  fact,  I believe  that  the 
data  since  1965  would  support  Adam's  1962  conjecture  (Ref.  36)  that:  "Throughput  per- 
formance is  proportional  to  the  square  root  of  the  price."  If  his  conjecture  is  true, 
the  cheapest  way  to  get  computing  done  is  to  get  the  smallest  computer  that  can  perform 
a particular  application.  The  $8.95  hand-held  calculator  may  be  an  illustration  of  the 
latter  "law." 
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WHY  DO  WE  USE  COMPUTERS? 

Everything  that  I have  stated  above  is  really  a smoke  screen  — it  obscures  the  rsal 
forces  and  real  trends  which  will  determine  the  shape  of  the  future.  If  you  listen  to 
data  processing  people  with  a critical  ear,  you  will  begin  to  be  appalled  at  the  fact 
that  they  seem  to  believe  that  the  computer  is  an  end  in  itself.  Remember  my  story 
about  the  quarter-inch  drills  and  the  quarter- inch  holes?  The  comparison  is  clear  — 
nobody  wants  computers  — people  only  want  to  get  their  work  done.  If  the  computer  is 
a tool  which  can  help  them  get  their  work  done,  they  ought  to  use  it,  and  will  use  it. 

If  the  computer  gets  in  people's  way,  they  may  be  forced  by  circumstances  to  use  it, 
but  their  productivity  will  be  lower  than  it  could  be.  We  must  never  forget  that  the 
work  is  done  by  people , not  computers. 

OPTIMIZATION  — LOCAL  VERSUS  GLOBAL 

In  most  large  organizations,  data  processing  costs  tend  to  be  somewhere  from  0.5%  to  2% 
of  total  costs.  Salaries  and  other  costs  of  people  are  between  4C%  and  90%  of  total 
costs.  The  efficiency  of  the  total  enterprise  is  the  efficiency  of  the  global  system 
of  people,  doing  the  necessary  work,  assisted  by  computers.  Yet,  at  meetings  of  com- 
puter people,  the  discussions  might  lead  one  to  believe  that  all  the  work  was  done  by 
computers.  Serious  papers  are  published  on  optimizing  the  hardware/software  sub-system, 
leading  the  unwary  to  believe  that  such  optimization  would  solve  the  whole  problem.  Such 
local  optimization  can  be  useful,  but,  as  is  well  known,  must  always  be  subordinated  to 
the  global  optimization  of  the  real  problem,  the  total  system. 

THE  EFFICIENCY  OF  PEOPLE 

Just  suppose  that  typewriters  had  never  been  invented.  All  the  secretaries  in  your 
office  are  producing  letters  and  reports  by  writing  them  out  very  neatly  in  longhand 
with  bail-point  pens.  Now  IBM  makes  a dramatic  announcement  — it  has  invented  the 
typewriter!  Next  morning,  at  every  large  corporation,  an  IBM  typewriter  salesman  has  an 
appointment  with  the  Vice  President  for  Office  Services.  Let  us  see  what  happens,  for 
example,  at  the  General  Electric  Corporation.  The  salesman  suggests  that  GE  buy  a $600 
typewriter  for  each  secretary.  This  is  a revolutionary  proposal.  Purchasing  is  called 
in.  Internal  Consulting  is  summoned  to  a meeting.  Operations  Research  is  charged  with 
investigating  the  concept,  and  conducting  a feasibility  study.  The  results  show  that 
the  average  use  of  a typewriter  by  a secretary  would  be  1.1873  hours  per  day,  and  that 
the  productivity  of  secretaries  would  increase  by  325.26%.  It  is  strongly  recommended 
that  the  corporation  convert  to  typewriters.  It  is  recommended  that  enough  typewriters 
be  procured  so  that  each  is  loaded  4.7492  hours  per  day.  Allowing  for  down  time  and 
assuming  that  most  overloads  can  be  handled  by  over-time,  each  four  secretaries  will 
share  one  typewriter. 

The  recommendation  is  adopted.  The  typewriters  are  delivered,  and  training  begins. 

Soon  all  secretaries  are  mechanically  proficient;  Purchasing  cancels  all  orders  for 
ball-point  pens;  and  the  system  is  cut  over  to  document  production  by  typewriter  only. 
Productivity  is  very  low  the  first  week.  It  is  worse  the  second  week,  worse  the  third, 
and  by  the  end  of  the  month,  the  backlog  of  letters  and  reports  has  reached  alarming 
proportions.  Most  executives  are  spending  a good  deal  of  their  time  consoling  tearful 
secretaries  who  complain  that  they  cannot  get  their  work  done  because  they  cannot  get 
access  to  a typewriter.  An  emergency  meeting  is  held.  Only  two  viable  alternatives 
present  themselves  — go  back  to  the  ball-point  pens,  or  get  a typewriter  for  each 
secretary!  Operations  Research  does  a fast  study,  and  concludes  that  spending  four  times 
as  much  for  the  typewriters  will  be  paid  for  many  times  over  by  the  increase  in  produc- 
tivity. The  recommendation  is  accepted,  the  secretaries  live  happily  ever  after,  and 
General  Electric  increases  its  dividends. 

The  moral  of  the  fairytale  is  clear.  Efficiency  of  people  can  vary  over  a wide  spectrum. 
The  productivity  of  programmers  has  been  studied  extensively;  no  general  conclusion  was 
possible  because  productivity  varied  by  a factor  of  ten  between  different  people,  and  even 
when  measuring  the  same  person  at  different  times.  All  data  on  the  efficiency  of  people 
show  that  it  can  seriously  be  Impaired  by  frustration  — "Nobody  lets  me  get  my  work  done 
without  interference."  Conversely,  productivity  seems  to  be  at  its  highest  when  the 
worker  has  full  control  of  the  tools  necessary  to  do  a job.  Many  of  you  have  had  exper- 
iences with  central  data  processing  that  parallel  those  that  the  secretaries  had  with 
the  shared  typewriters.  You  complain  of  the  service.  Data  processing  management  writes 
an  authoritative  study  showing  that  the  service  is  excellent  and  very  cost-effective 
(implying  that  you  are  the  money-waster) . You  fire  back  a memo  documenting  how  bad  it 
is.  In  the  meantime,  the  work  suffers.  What  would  happen  if  you  had  control  of  your 
own  computer?  If  anything  went  wrong,  you  would  look  around  for  someone  to  blame,  and 
see  no  one  but  your  own  unit.  You  would  roll  up  your  sleeves  and  solve  the  problem. 

Of  course,  the  moral  of  my  fairytale  would  not  be  applicable  if  typewriters  cost  $600,000 
apiece,  instead  of  $600.  Until  ten  years  ago,  computers  were  too  expensive  to  consider 
decentralizing  control  of  them.  But  I submit  that  today  their  prices,  including  software, 
are  approaching  the  low  level  where  it  is  foolish  to  sub-optimize  computing  costs  at  the 
expense  of  the  costs  of  the  people  who  really  do  the  work. 
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THE  PRINCIPLE  OF  DECENTPALIZED  COMPUTING 


t .. 


I believe  that  we  have  entered  an  era  where  guidance  in  acquiring  computing  power  can 
be  derived  from  a new  Principle  of  Decentralization: 

"If  an  organizational  group,  smaller  than  30  people,  requires  computer 
assistance,  it  is  better  for  the  total  enterprise  that  those  people  have 
exclusive  use  of  their  ow..  computer  — provided  that  the  computer,  big 
enough  to  dc  the  job  properly,  will  be  loaded  to  over  10%  of  its  capacity." 

The  underlined  words  are  vital  uo  the  application  of  the  principle. 

The  group  must  be  smaller  than  30  people,  a number  that  should  decrease  in  the  future 
as  the  cojts  of  the  computer  decreas';.  It  was  chosen  large  enough  so  that  the  cost  of 
a dedicated  computer  would  be  less  than  5%  of  the  total  per.;onnel  costs.  (Surely, 
adding  a computer  should  improve  the  efficiency  or  an  operating  unit  by  5%!)  On  the 
other  hand,  if  the  group  is  larger  than  30,  it  probably  will  have  more  than  one  kind 
of  work  to  do,  and  thus,  the  computer  would  beoin  to  be  v.sed  Cor  more  than  one  appli- 
cation. Dorn  shows,  in  Ref.  37  that  when  this  happens,  you  immediately  have  a small 
central  computing  facility,  with  all  its  attendant  evils,  and  you  lose  many  of  the 
advantages  of  decantraliza< ion.  Thus,  the  importance  of  the  words,  "exclusive  use," 
which  cannot  happen  unless  the  group  is  smaller  than  30  people. 

The  computer  must  be  big  enough  to  do  the  job  properly.  There  are  many  problems  which 
simply  cannot  be  handled  by  today's  small  computers.  Nuclear  reactor  design,  numerical 
prediction  of  global  weather,  maintenanca  of  reservatior s a year  ahead  for  all  of  the 
airline  seats  in  the  world,  instant  retrieval  from  massive  data  bases,  are  B^IG 
problems.  Such  applications  are  only  feasible  today  on  a computer  so  expensive  that 
is  only  practical  if  it  is  shared  by  many  groups  of  users.  I venture  no  predictions 
as  to  the  speed  w' th  which  technological  improvements  will  catch  up  with  these  applica- 
tions. Up  until  recently,  I believed  that  data  bases  would  be  the  last  application  to 
become  adaptable  to  a dedicated  small  computer.  But  a careful  analysis  shows  that  such 
is  not  always  the  case.  For  example,  data  bases  can  frequently  be  segmented  and  dis- 
tributed to  the  place  v^herc  the  segment  i»  used.  'Instant"  updating  may  be  unnecessary; 
Ref.  28  notes  that,  a large  insurance  policy  file  has  been  distributed  among  branch 
offices,  each  of  which  services  a distinct  small  group  of  policy  holders. 

1 have  recently  become  familiar  with  an  application  involving  a medium.-si zed  bibliographic 
data  base  — the  catalog  cards  of  the  Library  of  Congress.  Informatics  has  developed  a 
system,  called  KINI-MARC,  to  increase  the  productivity  of  the  cataloging  staff  of  a 
library.  Such  a library  uses,  as  its  source  information,  the  data  issued  in  MARC  format 
for  each  cook  by  the  Library  of  Congress.  However,  it  is  not  appropriate  to  use  the 
information  in  its  raw  form  for  the  particular  library’s  own  catalog,  which  is  unique 
because  of  the  unique  needs  of  the  l.ibrary.  The  cataloging  staff  eliminates  some  of  the 
MARC  data,  and  adds  additional  data.  The  Informatics  MINI-MARC  system  consists  of  an 
inexpensive  minicomputer  processor,  tv;o  floppy-disk  drives,  a cathode  ray  tube  terminal 
with  keyboard,  and  a typewriter  for  hard  copy  output.  A part  of  the  system  is  a data 
base  of  all  such  Library  of  Congress  MARC  data  on  400  diskettes.  Informatics  provides 
a subscription  service  to  keep  the  data  base  up  to  date  as  it  is  issued.  The  cataloger 
selects  the  proper  diskette  and  mounts  it  on  one  drive.  Using  the  cathode  ray  tube  and 
its  keyboard,  he  reads  the  information,  deletes  from  it,  adds  to  it,  puts  it  into 
exactly  the  form  that  he  wants  in  his  catalog,  and  writes  that  on  a diskette  on  the 
other  drive.  Thus,  he  has  two  data  bases  at  his  disposal  in  the  computer  — the  original 
catalog  card  and  his  own  library's  catalog.  The  diskettes  containing  his  own  library's 
catalog  are  periodically  processed  by  a large  computer  at  a central  site  to  produce  the 
printed  catalogs  his  library  needs.  The  entire  system  can  be  obtained  for  under  $50,000, 
including  all  the  software  to  make  it  work.  On  a lease  basis,  the  monthly  cost  would 
not  be  greater  than  the  salary  of  one  cataloger.  Two  such  systems  are  expected  to  be 
installed  in  a Government  library  in  the  near  future. 

The  last  part  of  the  principle  of  decentralization  speaks  to  the  utilization  of  such  a 
dedicated  computer.  It  conjectures  that  the  computer  should  be  loaded  to  over  10%  of 
its  capacity.  Logically,  of  course,  the  loading  is  immaterial,  as  long  as  the  pro- 
ductivity of  the  group  is  increased  enough  to  pay  for  the  computer.  However,  1 have 
added  this  requirement,  without  any  analytical  justification  for  the  selection  of  the 
10%  level  of  use,  because  of  my  belief  that,  if  it  is  to  be  successful,  the  assistance 
of  the  computer  to  the  group  should  not  be  for  some  trivial  application,  but  should 
participate  in  the  main-stream  activity  of  the  group. 

CONCLUSIONS 


It  is  now  clear  that  Grosch's  Law  has  been  repealed  by  technological  advances.  IT  IS  NO 
LONGER  TRUE  THAT  THE  MOST  COST-EFFECTIVE  WAY  TO  DO  DATA  PROCESSING  IS  ON  A LARGE  CENTRAL 
COMPUTER.  However,  the  current  investment  of  large  amounts  of  money  in  central  data 
processing  installations  and  in  the  organizations  built  to  support  them,  and  especially 
the  vested  interests  of  their  management  and  of  IBM's  manufacturing  capability,  will 
delay  for  many  years  the  inevitable  growth  of  decentralization. 

The  new  "Principle  of  Decentralization"  is  a useful  guide  to  evaluating  whether  to  use 
central  computing  or  not. 

It  is  likely  that  there  is  a big  future  for  standard  software  application  products, 
running  on  small  dedicated  computers  enhanced  by  emulation  hardware. 
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14.  Abstract 

The  ultimate  test  of  any  information  system  is  whether  or  not  the  user  can  quickly,  easily 
and  comprehensively  obtain  relevant  information  stored  therein.  As  data  bases  expand 
and  multiply,  retrieval  technology  must  advance  if  the  user  is  to  benefit  fully  from  this 
increased  information  potential.  For  this  reason,  three  of  the  four  sessions  of  this  meeting, 
covering  minicomputers,  bibliographic  data  base  sharing  and  future  technological  advances, 
explore  in  some  depth  areas  critically  important  to  the  user’s  need  to  access  an  increasing 
number  of  scientific  and  technical  information  collections  with  greater  ease  and  efficiency. 


The  other  session  was  somewhat  differently  oriented  and  addresses  numerical  data  bases, 
a relatively  new  and  rapidly  expanding  addition  to  information  systems.  Presentations 
include  not  only  the  development  of  such  data  collections  but  several  applications  as  well. 
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